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The number after each entry refers to a page where the symbol is explained in the text. 
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Q**, Rtt 


bla 
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la, b] 


(a), dQ, . 
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Za 


1p 
MR) 


MZ), M(Q), 
M(C), M(Z,) 


0 

M(R) 

R=S 

(c) 

(Cig Casita eh) 
a = b (mod I) 
a+I 

RJI 

I+J 

IJ 
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F(x) 
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deg f(x) 

Sx) |e) 

S(x) = g(x)(mod p(x) 
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Fix) 
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Polynomials 
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PREFACE 


This book is intended for a first undergraduate course in modern abstract algebra. 
Linear algebra is not a prerequisite. The flexible design makes the text suitable for 
courses of various lengths and different levels of mathematical sophistication, in- 
cluding (but not limited to) a traditional abstract algebra course, or one with a more 
applied flavor, or a course for prospective secondary school teachers. As in previous 
editions, the emphasis is on clarity of exposition and the goal is to produce a book that 
an average student can read with minimal outside assistance. 


New in the Third Edition 


Groups First Option Those who believe (as I do) that covering rings before groups 
is the better pedagogical approach to abstract algebra can use this edition exactly as 
they used the previous ones. 

Nevertheless, anecdotal evidence indicates that some instructors have used the sec- 
ond edition for a “groups first” course, which presumably means that they liked other 
aspects of the book enough that they were willing to take on the burden of adapting it to 
their needs. To make life easier for them (and for anyone else who prefers “groups first”) 


It is now possible (though not necessary) to use this text for 
a course that covers groups before rings. 


See the TO THE INSTRUCTOR section for details. 

Much of the rewriting needed to make this option feasible also benefits the “rings 
first” users. A number of them have suggested that complete proofs were needed in 
parts of the group theory chapters instead of directions that said in effect “adapt the 
proof of the analogous theorem for rings”. The full proofs are now there. 


Proofs for Beginners Many students entering a first abstract algebra course have 
had little (or no) experience in reading and wnting proofs. To assist such students (and 
better prepared students as well), a number of proofs (especially in Chapters 1 and 2) 
have been rewritten and expanded. They are broken into several steps, each of which 
is carefully explained and proved in detail. Such proofs take up more space, but I think 
it’s worth it if they provide better understanding. 

So that students can better concentrate on the essential topics, various items from 
number theory that play no role in the remainder of the book have been eliminated 
from Chapters 1 and 2 (though some remain as exercises). 


ix 


(Copyright 2012 Casgnas Leeming. All fight Raerved Mey not be copied, weaned. or Gyticatad, in whale or im part. Dus to electronic dghtr, mmo third party comes? may be mppmæai fine the eBouk exdVor eChagter(s). G4dorial review has 
drecnad that may sapgresned oosid does ont matrcialty afibct the oemf (emroing aperia Cengage Learning servea the right to reaepve efiditicnal codas at any Gane if suhanyurat rights cemtricthms require it. 


x Preface 


More Examples and Exercises In the core course (Chapters 1-8), there are 35% 
more examples than in the previous edition and 13% more exercises. Some older exer- 
cises have been replaced, so 18% of the exercises are new. The entire text has about 350 
examples and 1600 exercises. For easier reference, the examples are now numbered. 


Coverage The breadth of coverage in this edition is substantially the same as in 
the preceding ones, with one minor exception. The chapter on Lattices and Boolean 
Algebra (which apparently was rarely used) has been eliminated. However, it is avail- 
able at our website (www.CengageBrain.com) for those who want to use it. 

The coverage of groups is much the same as before, but the first group theory chapter 
in the second edition (the longest one in the book by far) has been divided into two chap- 
ters of more manageable size. This arrangement has the added advantage of making the 
parallel development of integers, polynomials, groups, and rings more apparent. 


Endpapers The endpapers now provide a useful catalog of symbols and notations. 


Website The website (www.CengageBrain.com) provides several downloadable 
programs for TI graphing calculators that make otherwise lengthy calculations in 
Chapters 1 and 14 quite easy. It also contains a chapter on Lattices and Boolean 
Algebra, whose prerequisites are Chapter 3 and Appendices A and B. 


Continuing Features 


Thematic Development The Core Course (Chapters 1-8) is organized around two 
themes: Arithmetic and Congruence. The themes are developed for integers (Chapters 1 
and 2), polynomials (Chapters 4and 5), rings (Chapters 3 and 6), andgroups (Chapters 7 
and 8). See the Thematic Table of Contents in the TO THE STUDENT section for a 
fuller picture. 


Congruence The Congruence theme is strongly emphasized hi the development of 
quotient rings and quotient groups. Consequently, students can see more clearly that 
ideals, normal subgroups, quotient rings, and quotient groups are simply an extension 
of familiar concepts in the integers, rather than an unmotivated mystery. 


Useful Appendices These contain prerequisite material (e.g., logic, proof, sets, 
functions, and induction) and optional material that some instructors may wish to 
introduce (e.g., equivalence relations and the Binomial Theorem). 
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TO THE INSTRUCTOR 


Here are some items that will assist you in making up your syllabus. 


Course Planning 


Using the chart on the opposite page, the Table of Contents (in which optional sections 
are marked), and the chapter introductions, you can easily plan courses of varying length, 
emphasis, and order of topics. If you plan to cover groups before rings, please note that 
Section 7.1 should be replaced by Section 7.1. A (which appears immediately after 7.1). 


Appendices 


Appendix A (Logic and Proof) is a prerequisite for the entire text. Prerequisites for 
various parts of the text are in Appendices B-F. Depending on the preparation of 
your students and your syllabus, you may want to incorporate some of this material 
into your course. Note the following. 
« Appendix B (Sets and Functions): The middle part (Cartesian 
products and binary operations) is first used in Section 3.1 [7.1.A].* The last 
five pages (injective and surjective functions) are first used in Section 3.3 [7.4]. 


* Appendix C (Induction): Ordinary induction (Theorem C.1) is first used 
in Section 4.4. Complete Induction (Theorem C.2) is first used in Section 4.1 
[9.2]. The equivalence of induction and well-ordering (Theorem C.4) is not 
needed in the body of the text. 

« Appendix D (Equivalence Relations): Important examples of 
equivalence relations are presented in Sections 2.1, 5.1, 6.1, and 8.1, but the 
formal definition is not needed until Section 10.4 [9.4]. 

e Appendix E (The Binomial Theorem): This is used only in Section 
11.6 and occasional exercises earlier. 

* Appendix F (Matrix Algebra): This is a prerequisite for Chapter 16 but 
is not needed by students who have had a linear algebra course. 


Finally, Appendix G presents a formal development of polynomials and indetermi- 
nates. I personally think it’s a bit much for beginners, but some people like it. 


Exercises 


The exercises in Group A involve routine calculations or short straightforward proofs. 
Those in Group B require a reasonable amount of thought, but the vast majority 
should be accessible to most students. Group C consists of difficult exercises. 

Answers (or hints) for more than half of the odd-numbered exercises are given 
at the end of the book. Answers for the remaining exercises are in the Instructor’s 
Manual available to adopters of the text. 


xii *The section numbers in brackets are for groups-first courses. 
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To the Instructor xiii 


CHAPTER INTERDEPENDENCE* 


L 
Arithmetic 
inZ 


13. 


14.1 
Chinese Remainder |*4-------------- Publio-Key 
Theorem (CRT) Cryptography 
14.2 
Applications |~—-——~-——— 
of the CRT 
15. 
Geometric ---- 
Constructions 
14.3 16.1, 16.2 
The CRT Algebraic 
for Rings Coding 


Theory 


16.3 


NOTE: To go quickly from Chapter 3 to Chapter 6, first cover Section 4.1 (except the 
proof of the Division Algorithm), then proceed to Chapter 6. If you plan to cover 
Chapter 11, however, you will need to cover Chapter 4 first. 


*A solid arrow A—->8 means that A is a prerequisite for 8; a dashed arrow A->B means that B depends 
only on parts of A (see the Table of Contents for specifics). For the dotted arrow 3--»6, see the Note 
at the bottom of the chart. 
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TO THE STUDENT 


Overview 


This book begins with grade-school arithmetic and the algebra of polynomials from 
high school (from a more advanced viewpoint, of course). In later chapters of the 
book, you will see how these familiar topics fit into a larger framework of abstract 
algebraic systems. This presentation is organized around these two themes: 


Arithmetic You will see how the familiar properties of division, remainders, factor- 
ization, and primes in the integers carry over to polynomials, and then to more general 
algebraic systems. 


Congruence You may be familiar with “clock arithmetic”.* This is an example of 
congruence and leads to new finite arithmetic systems that provide a model for what 
can be done for polynomials and other algebraic systems. Congruence and the related 
concept of a quotient object are the keys to understanding abstract algebra. 


Proofs 


The emphasis in this course, much more than in high-school algebra, is on the rigor- 
ous logical development of the subject. If you have had little experience with reading 
or writing proofs, you would do well to read Appendix A, which summarizes the basic 
rules of logic and the proof techniques that are used throughout the book. 

You should first concentrate on understanding the proofs in the text (which is quite 
different from constructing a proof yourself). Just as you can appreciate a new build- 
ing without being an architect or a contractor, you can verify the validity of proofs 
presented by others, even if you can't see how anyone ever thought of doing it this way 
in the first place. 

Begin by skimming through the proof to get an idea of its general outline before 
worrying about the details in each step. It’s easier to understand an argument if you 
know approximately where it’s headed. Then go back to the beginning and read the 
proof carefully, line by line. If it says “such and such is true by Theorem 5.18”, check 
to see just what Theorem 5.18 says and be sure you understand why it applies here. If 
you get stuck, take that part on faith and finish the rest of the proof. Then go back and 
see if you can figure out the sticky point. 


“When the hour hand of a clock moves 3 hours or 15 hours from 12, it ends in the same position, so 
3 = 15 on the clock. If the hour hand starts at 12 and moves 8 hours, then moves an additional 
9 hours, it finishes at 5; so 8 + 9 = 5 on the clock. 

xiv 
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To the Student xv 


When you're really stuck, ask your instructor. He or she will welcome questions that 
arise from a serious effort on your part. 


Exercises 


Mathematics is not a spectator sport. You can’t expect to learn mathematics without 
doing mathematics, any more than you could learn to swim without getting in the 
water. That’s why there are so many exercises in this book. 

The exercises in group A are usually straightforward. If you can’t do almost all of 
them, you don’t really understand the material. The exercises in group B often require 
a reasonable amount of thought—and for most of us, some trial and error as well. But 
the vast majority of them are within your grasp. The exercises in group C are usually 
difficult . . . a good test for strong students. 

Many exercises will ask you to prove something. As you build up your skill in un- 
derstanding the proofs of others (as discussed above), you will find it easier to make 
proofs of your own. The proofs that you will be asked to provide will usually be much 
simpler than proofs in the text (which can, nevertheless, serve as models). 

Answers (or hints) for more than half of the odd-numbered exercises are given at 
the back of the book. 


Keeping It All Straight 


In the Core Course (Chapters 1-8), students often have trouble seeing how the various 
topics tie together, or even if they do. The Thematic Table of Contents on the next two 
pages is arranged according to the themes of arithmetic and congruence, so you can 
see how things fit together. 
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Primality Testing 1.3 Theorem 1.10 4.4 Polynomial Functions, 
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Thematic Table of Contents for the Core Course xvii 


Directions: Reading from left to right across these two pages shows how the theme or 
subtheme in the left-hand column is developed in the four algebraic systems listed in the 
top row. Each vertical column shows how the themes are carried out for the system listed 
at the top of the column. 


RINGS* GROUPS* 


3. Rings 
3.1 Rings 


7. Groups 
7.1 Definition and Examples of Groups 
7.5 The Symmetric and Alternating Groups 


3.2 Basic Properties of Rings 7.2 Basic Properties of Groups 


7.3 Subgroups 


8. Normal Subgroups and Quotient Groups 
8.1 Congruence 
8.2 Normal Subgroups 
8.5 The Simplicity of A, 


6. Ideals and Quotient Rings 
6.1 Ideals and Congruence 


6.2 Quotient Rings and 
Homomorphisms 


8.3 Quotient Groups 
8.4 Quotient Groups and Homomorphisms 


6.3 The Structure of R/I When IIs 
Prime or Maximal 


3.3 Isomorphisms and 7.4 Isomorphisms and Homomorphisms 
Homomorphisms 


*in the Arithmetic Theme, the sections of Chapters 3 (Rings) and 8 (Groups) do not correspond to the individual 
subthemes (as do the sections of Chapters 1 and 4). For integral domains, however, there is a correspondence, as 
you will see in Chapter 10 (Arithmetic in Integral Domains). 
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PART 


THE CORE COURSE 
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CHAPTER ] 


Arithmetic in Z Revisited 


Algebra grew out of arithmetic and depends heavily on it. So we begin our study of 
abstract algebra with a review of those facts from arithmetic that are used frequently 
in the rest of this book and provide a model for much of the work we do. We stress 
primarily the underlying pattern and properties rather than methods of computation. 
Nevertheless, the fundamental concepts are ones that you have seen before. 


[aa The Division Algorithm 


Our starting point is the set of all integers Z = {0, +1, +2, . . .}. We assume that you 
are familiar with the arithmetic of integers and with the usual order relation (<) on 
the set Z. We also assume the 


WELL-ORDERING AXIOM Every nonempty subset of the set of nonnegative 
integers contains a smallest element. 


If you think of the nonnegative integers laid out on the usual number line, it is 
intuitively plausible that each subset contains an element that lies to the left of all the 
other elements in the subset—that is the smallest element. On the other hand, the Well- 
Ordering Axiom does not hold in the set Z of all integers (there is no smallest negative 
integer). Nor does it hold in the set of all nonnegative rational numbers (the subset of 
all positive rationals does not contain a smallest element because, for any positive ratio- 
nal number r, there is always a smaller positive rational—for instance, r/2). 


NOTE: The rest of this chapter and the next require Theorem 1.1, which 
is stated below. Unfortunately, its proof is a bit more complicated than 
is desirable at the beginning of the course, since some readers may not 
have seen many (or any) formal mathematical proofs. To alleviate this 
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4 Chapter 1 Arithmetic in Z Revisited 


situation, we shall first look at the origins of Theorem 1.1 and explain the 
idea of its proof. Unless you have a strong mathematical background, we 
suggest that you read this additional material carefully before beginning 
the proof. 

To ease the beginner’s way, the proof itself will be broken into several 
steps and given in more detail than is customary in most books. However, 
because the proof does not show how the theorem is actually used in prac- 
tice, some instructors may wish to postpone the proof until the class has 
more experience in proving results. In any case, all students should at least 
read the outline of the proof (its first three lines and the statements of 
Steps 1-4). 


So here we go. Consider the following grade-school division problem: 


Quotient ——> 11 Check: 11 «<— Quotient 
Divisor —>7)82 X7 <— Divisor 
Dividend J. 77 
12 +5 <— Remainder 
7 82 <—— Dividend 


Remainder —> 5 


The division process stops when we reach a remainder that is less than the divisor. 
All the essential facts are contained in the checking procedure, which may be verbally 
summanized like this: 


dividend = (divisor) (quotient) + (remainder). 


Here is a formal statement of this idea, in which the dividend is denoted by a, the 
divisor by b, the quotient by q, and the remainder by r: 


Theorem 1.1 The Division Algorithm 


Let a, b be integers with b > 0. Then there exist unique integers q and r such 
that 


a=bq+r and Osr<b. 

Theorem 1.1 allows the possibility that the dividend a might be negative but re- 
quires that the remainder r must not only be less than the divisor b but also must be 
nonnegative. To see why this last requirement is necessary, suppose a = —14 is divided 
by b = 3, so that —14 = 3q + r. If we only require that the remainder be less than 
the divisor 3, then there are many possibilities for the quotient g and remainder r, 
including these three: 

—14 = 3(-3) + (5), with -5 < 3 [Here q = —3 andr = —5.] 
-14 = 3(-4) + (2), with-2<3 [Hereq = —4 andr = -2.] 
—14 = 3(-5) + 1, with 1<3 [Here q = —S andr = 1.]. 
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1.1 The Division Algorithm 5 


When the remainder is also required to be nonnegative as in Theorem 1.1, then there 
is exactly one quotient q and one remainder r, namely, q = —5 and r = 1, as will be 
shown in the proof. 

The fundamental idea underlying the proof of Theorem 1.1 is that division is just 
repeated subtraction. For example, the division of 82 by 7 is just a shorthand method 
for repeatedly subtracting 7: 


82 

-7 

75 — 82 -7:1 40 

=i = 

68 <—82-7:2 33 — 82 -7-7 
-7 -7 

61 — 82 -7'3 26 4—82 -7:8 
-7 -7 

54 <— 82 -7'4 19 — 82 - 7:9 
-7 -7 

47 <—82-7°5 12 <— 82 -7-10 
-7 -7 

40 4—82 -7'6 -5 «— 82 - 7: 11 


The subtractions continue until you reach a nonnegative number less than 7 (in this 
case 5). The number 5 is the remainder, and the number of multiples of 7 that were 
subtracted (namely, 11, as shown at the right of the subtractions) is the quotient. 

In the preceding example we looked at the numbers 


82 -7-1, 82—7-2, 82-7: 3,and so on. 


In other words, we looked at numbers of the form 82 — 7x for x = 1, 2, 3,... and 
found the smallest nonnegative one (namely, 5). In the proof of Theorem 1.1 we shall 
do something very similar. 


Proof of Theorem 1.1* > Let a and b be fixed integers with b > 0. Consider the set S 
of all integers of the form 


a — bx, where x is an integer and a — bx = 0. 
Note that x may be any integer—positive, negative, or O—but a — bx must 
be nonnegative. There are four main steps in the proof, as indicated below. 
Step 1 Show that S is nonempty by finding a value for x such that a — bx = 0. 


Proof of Step I: We first show that a + b|a| = 0. Since b is a positive 
integer by hypothesis, we must have 


b21 
bla] = |a| [Multiply both sides of the preceding inequality by |a\.] 
bla| =—a [Because |a| = -a by the definition of absolute value.) 
a+ bja| = 0. 


*For an alternate proof by induction of part of the theorem, see Example 2 in Appendix C. 
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6 Chapter 1 Arithmetic in Z Revisited 


Now let x = —|a|. Then 
a-—bx=a-—b(-|al) =a + dlal=0. 


Hence, a — bx is in S when x = —|a], which means that S is nonempty. 


Step 2 Find q and r such thata = bq + randr= 0. 


Proof of Step 2: By the Well-Ordering Axiom, S contains a smallest 
element—call it r. Since r € S, we know that r = 0 andr = a — bx for 
some x, say x = q. Thus, 


r=a ~ bą and r=0, or, equivalently, a=bg+randr=O0. 


Step 3 Show that r < b. 


Proof of Step 3: We shall use a “proof by contradiction” (which is 
explained on page 506 of Appendix A). We want to show that r < b. 
So suppose, on the contrary, that r = b. Then r — b = 0, so that 


0sr-b=(a-bq)-b=a-bq+!). 


Since a — b(q + 1) is nonnegative, it is an element of S by definition. But 
since b is positive, it is certainly true that r — b < r. Thus 


a-bq+l=r-b<r. 


The last inequality states that a — b(q + 1)}—which is an element of 
S—is less than r, the smallest element of S. This is a contradiction. 
So our assumption that r = b is false, and we conclude that r < b. 
Therefore, we have found integers q and r such that 


a=bqt+r and Osr<b. 


Step 4 Show that r and q are the only numbers with these properties (that’s what 
“unique” means in the statement of the theorem). 


Proof of Step 4: To prove uniqueness, we suppose that there are integers 
qı and rı such that a = bg, + ri andO = r, < b, and prove that q; = q 
andr, =r. 

Since a = bg + rand a = bq, + r,, we have 


bq +r = bq +r 


so that 
(*) bq-a)=n-r. 
Furthermore, 
Osr<b 
Osn<b. 
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1.14 The Division Algorithm 7 


Multiplying the first inequality by — 1 (and reversing the direction of the 
inequality), we obtain 
-b<-rsod 
0s fi < b. 


Adding these two inequalities produces 


—-b<rn—r<b 
—b<b(g-—q)<5 [By Equation (*)] 
-l<q-q,<1 [Divide each term by b] 


But q — qı is an integer (because q and q; are integers) and the only 
integer strictly between —1 and 1 is 0. Therefore q — q, = Oand q = q). 
Substituting q — qı = 0 in Equation (*) shows that r, — r = 0 and 
hence r = r. Thus the quotient and remainder are unique, and the 
proof is complete. m* 


When both the dividend a and the divisor b in a division problem are positive, then 
the quotient and remainder are easily found either by long division (as on page 4) or 
with a calculator when the integers involved are larger. 


EXAMPLE 1 


Suppose a = 4327 is divided by b = 281. Entering a/b in a calculator produces 
15.39857 - - - . The integer to the left of the decimal point (15 here) is the quo- 
tient q and the remainder is 


r = a — bq = 4327 — 281-15 = 112. 


These calculations are shown on the graphing calculator screen in Figure 1. 


277281 
15. 39857651 
4327-281 +15 


FIGURE 1 


When the dividend a is negative, a slightly different procedure is needed so that the 
remainder will be nonnegative. 
“The symbol W indicates the end of a proof. 
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EXAMPLE 2 


Suppose a = ~—7432 is divided by b = 453. Entering a/b ina calculator pro- 
duces — 16.40618 - - - . In this case the quotient q is not — 16; instead, 


= (the integer to the left of the decimal point) -1 = —16 — 1 = —17. 


(Without this adjustment, you will end up with a negative remainder.) Now, as 
usual, 
r = a ~ bq = ~7432 — 453: (-17) = 269. 


The preceding calculations are summarized in the calculator screen in Figure 2. 


-74327453 
-16. 40618102 
-7432-453*¢ -17 


FIGURE 2 


E Exercises 


A. In Exercises 1 and 2, find the quotient q and remainder r when a is divided by b, 
without using technology. Check your answers. 


1. (a) a= 17;5=4 (b) a = 0;b = 19 (c) a= —17;b=4 
2. (a) a= —51;5 = 6 (b) a = 302; b = 19 (c) a = 2000; b = 
In Exercises 3 and 4, use a calculator to find the quotient q and remainder r when 
a is divided by b. 
3. (a) a = 517; b = 83 (b) a = —612; b = 74 
(c) a = 7,965,532; b = 127 
4. (a) a = 8,126,493; b = 541 (b) a = —9,217,645; b = 617 


(c) a = 171,819,920; b = 4321 
5. Let a be any integer and let b and c be positive integers. Suppose that when 
ais divided by b, the quotient is q and the remainder is r, so that 
a=bqtr and Osr<b. 
If ac is divided by bc, show that the quotient is q and the remainder is re. 


B. 6. Leta, b, c, and q be as in Exercise 5. Suppose that when q is divided by c, the 
quotient is k. Prove that when a is divided by bc, then the quotient is also k. 


7. Prove that the square of any integer a is either of the form 3k or of the 
form 3k + 1 for some integer k. [Hint: By the Division Algorithm, a must 
be of the form 3q or 3q + 1 or 3g +2] 
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1.2 Divisibility 9 


8. Use the Division Algorithm to prove that every odd integer is either of the 
form 4k + 1 or of the form 4k + 3 for some integer k. 


9. Prove that the cube of any integer a has to be exactly one of these forms: 9k 
or 9k + 1 or 9k + 8 for some integer k. [Hint: Adapt the hint in Exercise 7, 
and cube a in each case.] 


10. Let z be a positive integer. Prove that a and ¢ leave the same remainder when 
divided by n if and only if a — c = nk for some integer k. 


11. Prove the following version of the Division Algorithm, which holds for both 
positive and negative divisors. 


Extended Division Algorithm, Let a and b be integers with b # 0. Then there 
exist unique integers q and r such that a = bq + randO sr < |b|. 


[Hint: Apply Theorem 1.1 when a is divided by |b|. Then consider two cases 
(b > Oand b< 0)] 


i] Divisibility 


An important case of division occurs when the remainder is 0, that is, when the divisor 
is a factor of the dividend. Here is a formal definition: 


Definition Let a and b be integers with b + 0. We say that b divides a (or that b is a divi- 
sor of a, or that b is a factor of a) if a = bc for some integer c. In symbols, “b 
divides a" is written b |a and "b does not divide a” is written b fa. 


EXAMPLE 1 


3 | 24 because 24 = 3 - 8, but 3} 17. Negative divisors are allowed: —6 | 54 
because 54 = (—6)(—9), but —6 ¥ (~13). 


EXAMPLE 2 


Every nonzero integer b divides 0 because 0 = b : 0. For every integer a, we 
have 1|a because a = | «a. 


Remark If b divides a, then a = bc for some c. Hence —a = b(—c}, so that 
b|(~a). An analogous argument shows that every divisor of —a is also a divisor of a. 
Therefore 

a and —a have the same divisors. 


Remark Supposea # 0and d| a. Thena = bc, so that |a|= jbl |e]. Consequently, 
0 = |b| = ja|. This last inequality is equivalent to — la| = b = jaj. Therefore 


(i) every divisor of the nonzero integer a is less than or equal to | a |; 


(ii) a nonzero integer has only finitely many divisors. 
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10 Chapter1 Arithmetic in Z Revisited 


All the divisors of the integer 12 are 
1, —1, 2, —2, 3, —3, 4, —4, 6, ~6, 12, -12. 
Similarly, all the divisors of 30 are 
Lys 125-2 28'35, 235551 - S16; 6510) = 10; 15; = 155305: 30: 


The common divisors of 12 and 30 are the numbers that divide both 12 and 30, that 
is, the numbers that appear on both of the preceding lists: 


Ll, 2,12, 3-36, =E 


The largest of these common divisors, namely 6, is called the “greatest common 
divisor” of 12 and 30. This is an example of the following definition. 


Definition Leta and b be integers, not both 0. The greatest common divisor (gcd) of 
a and b is the largest integer ¢ that divides both a and b. In other words, 
dis the g¢d of a and b provided that 


(1) d|aand d|d; 
(2) ifc]aand c|b, then c = d. 


The greatest common divisor of a and b is usually denoted (a, b). 


If a and b are not both 0, then their gcd exists and is unique. The reason is that 
a nonzero integer has only finitely many divisors, and so there are only a finite num- 
ber of common divisors. Hence there must be a unique largest one. Furthermore, the 
greatest common divisor of a and b satisfies the inequality 


(a,b) = 1 


because | is a common divisor of a and b. 


EXAMPLE 3 


(12, 30) = 6, as shown above. The only common divisors of 10 and 21 are 1 and 
—1. Hence (10, 21) = 1. Two integers whose greatest common divisor is 1, such 
as 10 and 21, are said to be relatively prime. 


EXAMPLE 4 


The common divisors of an integer a and 0 are just the divisors of a. If a > 0, 
then the largest divisor of a is clearly a itself. Hence, if a >-0, then (a, 0) = a. 
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1.2 Divisibility 11 


Listing all the divisors of two integers in order to find their gcd can be quite time 
consuming. However, the Euclidean Algorithm (Exercise 15) is a relatively quick 
method for finding gcd’s by hand. You can also use technology. 


Technology Tip: To find a gcd on a Ti-graphing calculator, select “god” in the 
NUM submenu of the MATH menu. 


We have seen that 6 = (12, 30). A little arithmetic shows that something else is true 
here: 6 is a linear combination of 12 and 30. For instance, 


6 = 12(-2)+ 30(1) and 6 = 12(8) + 30(—3). 


You can readily find other integers u and v such that 6 = 12u + 30v. The following 
theorem shows that the same thing is possible for any greatest common divisor. 


Theorem 1.2 


Let a and 6 be integers, not both 0, and let d be their greatest common divi- 
sor. Then there exist (not necessarily unique) integers u and y such that 
d = au + by. 


CAUTION: Read the theorem carefully. The fact that d = au + bv does 
not imply that d = (a, b). See Exercise 25. 


For the benefit of inexperienced readers, the proofs of Theorem 1.2 and 
Corollary 1.3 will be broken into several steps. The basic idea of the proof of 
Theorem 1.2 is to look at all possible linear combinations of a and b and find one 
that is equal to d. 


Proof of Theorem 1.2 > Let S be the set of all linear combinations of a and b, that is 
S= {am + bn|m,n € 8}. 


Step 1 Find the smallest positive element of S. 


Proof of Step 1: Note that a’ + b? = aa + bb is in Sand a’ + b = 0. 
Since a and b are not both 0, a? + b? must be positive. Therefore S 
contains positive integers and hence must contain a smallest positive 
integer by the Well-Ordering Axiom. Let t denote this smallest positive 
element of S. By the definition of S, we know that t = au + bv for 
some integers u and v. 
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Step 2 Prove that t is the gcdof a and b, that is, t= d. 


Proof af Step 2: We must prove that t satisfies the two conditions in the 
definition of the gcd: 


(1) t|a and t|b; 
(2) If c|aandc|b, then c = t. 


Proof of (1): By the Division Algorithm, there are integers q and r 

such that a = tq + r, with Os r < t. Consequently, 

r=a-— iq, 

r =a ~ (au + bu)qg =a — aqu — bug, 

r = a(l — qu) + b(—g) 
Thus r is a linear combination of a and b, and hence r € S. Since 
r < t (the smallest positive element of S), we know that r is not 
positive. Since r = 0, the only possibility is that r = 0. Therefore, 


a = tq + r= tq + 0 = tq, so that t |a. A similar argument shows 
that t|b. Hence, t is a common divisor of a and b. 


Proof of (2): Let c be any other common divisor of a and b, so that 
c |a and c|b. Then a = ck and b = cs for some integers k and s. 
Consequently, 


t = au + bv = (ck)u + (es)v 
= c(ku + sv). 


The first and last terms of this equation show that c |t. Hence, 
c S|t|by the second Remark on page 9. But tis positive, so |t| = t. 
Thuse St. 
This shows that ¢ is the greatest common divisor d and completes 
the proof of the theorem. m 


Technology Tip: To find the gcd of a and b and express it in the form au + bv on 
a TI calculator, download the GCD program on our website (www.CengageBrain 
.com). Figure 1 shows the result when you enter a = 2579 and b = 4321: The gcd 
is | and you can easily verify that 2579 - 826 + 4321 - (—493) = 1. 


AU+BV=GCD= i 


FIGURE 1 


To do the same thing with Maple, use the command igcdex(a, b, ‘s’, ‘t’);. 
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Corollary 1.3 


Let aand b be integers, not both 0, and let d be a positive integer. Then d is the 
greatest common divisor of a and b if and only if d satisfies these conditions: 


(i)d |a and d|b; 
(ii) if c [aand c| b, then c |d. 
Proof» The proof of an “if and only if” statement requires two steps 
(see page 507 in Appendix A). 
Step 1 Prove: If d= (a,b), then d satisfies conditions (i) and (ii). 
Proof of Step 1: If d = (a, b), then by the definition of the gcd, d divides 
both a and b. So d satisfies condition (i). 
To venfy that d satisfies condition (ii), suppose that c is an integer such 


that c |a and c|b. Then a = cr and b = cs for some integers r and s, by the 
definition of “divides”. By Theorem 1.2 there are integers u and v such that 


d=au + bu 
d=(crju+(cs)u [Because a = cr andb = ces] 
d = c(ru + sv) [Factor c out of both terms.] 


But this last equation says that c|d. Therefore, d satisfies condition (ii). 


Step 2 Prove. If dis a positive integer that satisfies conditions (i) and (ii), then 
d= (a,b). 
Proof of Step 2: To prove that d = (a, b), we must show that d satisfies 
the requirements of the definition of the gcd, namely, 


(1) d|a and d|b; 
(2) If c|a and c |b, then c = d. 


Obviously d satisfies (1) since requirement (1) and condition (i) are 
identical. To prove that d satisfies requirement (2), suppose c is an inte- 
ger that divides both a and b, then c| d by condition (ii). Consequently, 
by the second Remark on page 9, ¢ = |d|. But dis positive, so |d| = 
Thus, c = d. Therefore, d satisfies requirement (2) and, hence, d is the 
gcd of aand b. E 


The answer to the following question will be needed on several occasions. If a | bc, 
then under what conditions is it true that a |b or a|c? It is certainly not always true, as 
this example shows: 


6|3°4, but 643 and 644. 


Note that 6 has a nontrivial factor in common with 3 and another in common with 4. 
When a divisor of bc has no common factors (except +1) with either b or c, then there 
is a useful answer to the question. 
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Theorem 1.4 


If a|be and {a, b) = 1, then ajc. 


Proof > since (a, b) = 1, Theorem 1.2 shows that au + bv = 1 for some integers 
uand v. Multiplying this equation by c shows that acu + bcv = ¢. But 


a |bc, so that be = ar for some r. Therefore 
c = acu + bev = acu + (ar)v = a(cu + rv). 


The first and last parts of this equation show that a| c. 


E Exercises 


hand, but technology is OK for the rest. 

(a) (56, 72) (b) (24, 138) (e) (112, 57) 
(d) (143, 231) (e) (306, 657) (f) (272, 1479) 
(g) (4144, 7696) (h) (12378, 3054) 


2. Prove that b|a if and only if (—b) |a. 
3. If a|band b|c, prove that a|c. 


a» 


weet am 


12. 


13. 
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. (a) If a|b and ale, prove that a|(b + o). 


(b) If a|b and a|c, prove that a|(br + ct) for any r,t € Z. 


. If a and b are nonzero integers such that a |b and b |a, prove that a = +b. 


. If a|b and c|d, prove that ac | bd. 


If a < 0, find (a, 0). 


. Prove that (n, n + 1) = 1 for every integer n. 

. If a|c and b| c, must ab divide c? Justify your answer. 
. If (a, 0) = 1, what can a possibly be? 

. If n © Z, what are the possible values of 


(a) (%jn+2)  (b) (2+ 6) 


Suppose that (a, b) = 1 and (a, c) = 1. Are any of the following statements 


false? Justify your answers. 
(a) (@b,a)=1 (0) (,0=1 (c) (ab,c)=1 


. Find the greatest common divisors. You should be able to do parts (a)—(c) by 


Suppose that a, b, q, and r are integers such that a = bg + r. Prove each of the 


following statements. 


(a) Every common divisor c of a and b is also a common divisor of b and r. 


[Hint: For some integers s and ¢, we have a = es and b = ct. Substitute 


these results into a = bg + r, and show that c|r.] 


thet any Cppeeed cries does not matarialy affect the ovant lamiug apariat Cengage Lemming marves ths right to sarve widirionel content at esy tne if 


require i. 


1.2 Divisibility 15 


(b) Every common divisor of b and r is also a common divisor of a and b. 
(c) (a, b) = (b, r). 

14. Find the smallest positive integer in the given set. [Hint: Theorem 1.2.] 
(a) {6u + l5v|u,v E Z} (b) {12r + 17s|r,s € Z} 


15. The Euclidean Algorithm is an efficient way to find (a, b) for any positive 
integers a and b. It only requires you to apply the Division Algorithm 
several times until you reach the gcd, as illustrated here for (524, 148). 


(a) Verify that the following statements are correct. 


524 = 148-3 +80 0=380< 148 


wo Os [The divisor in each line becomes 
148 = 80-1 +68 0568 <80 the dividend in the next line, 
-* Pa and the remainder in each line 


80 =, 68-3 +12 0512<68 becomes the divisor in the next line] 


2 ia 


68 = 125+8 0=<8<12 


I= 8144 054<8 [As shown in part (b), the last 
ae nonzero remainder, namely 4, 
8= 42+0 is the gcd (a, b).] 


(b) Use part (a) and Exercises 13 and Example 4 to prove that 

(524, 148) = (148, 80) = (80, 68) = (68, 12) = (12, 8) = (8, 4) = (4, 0) = 4. 
Use the Euclidean Algorithm to find 
(c) (1003, 456) (d) (322, 148) (e) (5858, 1436) 


The equations in part (a) can be used to express the gcd 4 as a linear 
combination of 524 and 148 as follows. First, rearrange the first 5 equations in 
part (a), as shown below. 


80 = 524 — 148-3 (1) 
68 = 148 ~ 80 (2) 
12 = 80 — 68-3 (3) 
= 68 — 12-5 (4) 
=12-8 (5) 


(f) Equation (1) expresses 80 as a linear combination of 524 and 148. Use this 
fact and Equation (2) to write 68 as a linear combination of 524 and 148. 


(g) Use Equation (1), part (f), and Equation (3) to write 12 as a linear 
combination of 524 and 148. 


(b) Use parts (f) and (g) to write 8 as a linear combination of 524 and 148. 


(i) Use parts (g) and (h) to write the gcd 4 as a linear combination of 524 and 
148, as desired. 


(i) Use the method described in parts (f}-(i) to express the gcd in part (c) asa 
linear combination of 1003 and 456. 
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B. 16. 


17. 


31. 


Arithmetic in Z Revisited 


If (a, b) = d, prove that G Ae 1. [Hint: a = dr and b = ds for some 
integers r and s (Why?). So a/d = r and b/d = s and you must prove that 
(r,s) = 1. Apply Theorem 1.2 to (a, b) and divide the resulting equation by d.] 


Suppose (a, b) = 1. If a |c and b|c, prove that ab |c. [Hint: c = bt (Why?), so 
a|bt. Use Theorem 1.4.] 


. If c > 0, prove that (ca, cb) = c(a, b). [Hint: Let (a, b) = dand (ca, cb) = k. 


Show that cd|k and k | cd. See Exercise 5.] 


. If a|(b + c) and (b, c) = 1, prove that (a, b) = 1 = (a, o). 

. Prove that (a, b) = (a, b + at) for every t E Z. 

. Prove that (a, (b, c)) = ((a, b), c). 

. If (a, c) = 1 and (b, c) = 1, prove that (ab, c) = 1. 

. Use induction to show that if (a, b) = 1, then (a, 5") = 1 for all n = 1.* 


. Let a, b,c E Z. Prove that the equation ax + by = c has integer solutions if 


and only if (a, b)|c. 


. (a) If a, b, u,v E Z are such that au + bv = 1, prove that (a, b) = 1. 


(b) Show by example that if au + bv = d > 1, then (a, b) may not be d. 


. If a|c and b|c and (a, b) = d, prove that ab | cd. 
. If c| ab and (c, a) =-d, prove that c| db. 
. Prove that a positive integer is divisible by 3 if and only if the sum of its digits 


is divisible by 3. [Hint: 10? = 999 + 1 and similarly for other powers of 10.] 


. Prove that a positive integer is divisible by 9 if and only if the sum of its digits 


is divisible by 9. [See Exercise 28.] 


. If ay, az, ..., An are integers, not all zero, then their greatest common 


divisor (gcd) is the largest integer d such that d|a; for every i. Prove that 
there exist integers u; such that d = aju; + ayug + +++: + a,u [Hint: Adapt 
the proof of Theorem 1.2.] 


The least common multiple (lcm) of nonzero integers a4, a), ..., a is the 
smallest positive integer m such that a,|m for i = 1, 2,..., k and is denoted 


[a, &,..-, a). 
(a) Find each of the following: [6, 10], [4, 5, 6, 10], [20, 42], and [2, 3, 14, 36, 42]. 


(b) If ¢ is an integer such that a,|t for i= 1,2,...,, prove that 
[a1, d2, ~. ., A] t. [Hint: Denote [a;, a2,... , ay] by m. By the Division 
Algorithm, t = mq + r, with 0 = r < m. Show that a,|r for i = 1,2,...,k. 
Since m is the smallest positive integer with this property, what can you 
conclude about r?] 


*Induction is discussed in Appendix C. 
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32. Let a and b be integers, not both 0, and let ¢ be a positive integer. Prove that ¢ is 
the least common multiple of a and b if and only if t satisfies these conditions: 


(i) aļt and b|ż; 
(ii) If ajc and bļe, then tfc. 


C. 33. If a >O0and b > 0, prove that [a, b] = [a, b] is defined in Exercise 31.) 


ab 
Teby 
34. Prove that 
(a) (a, b)|(a + b,a — b); 
(b) if ais odd and b is even, then (a, b) = (a + b, a — b); 
(c) if a and b are odd, then 2(a, b) = (a + b, a — b). 


mE Primes and Unique Factorization 


Every nonzero integer n except +1 has at least four distinct divisors, namely 1, ~1, n, —n. 
Integers that have only these four divisors play a crucial role, 


Definition An integer p is said to be prime if p # 0, +1 and the only divisors of p are 


+1 and +p. 


EXAMPLE 1 


3, —5, 7, —11, 13, and —17 are prime, but 15 is not (because 15 has divisors 
other than +1 and +15, such as 3 and 5). The integer 4567 is prime, but prov- 
ing this fact from the definition requires a tedious check of all its possible divi- 
sors. Fortunately, there are more efficient methods for determining whether an 
integer is prime, one of which is discussed at the end of this section. 


It is not difficult to show that there are infinitely many distinct primes (Exercise 32). 
Because an integer p has the same divisors as —p, we see that 


p is prime if and only if —p is prime. 


If p and gare both prime and p |¢, then p must be one of 1, —1, g, —q. But since p is 
prime, p # +1. Hence, 


if p and q are prime and p|q, then p = q. 


Under what conditions does a divisor of a product be necessarily divide b or c? 
Theorem 1.4 gave one answer to this question. Here is another. 
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Theorem 1.5 
Let p be an integer with p + 0, +1. Then p is prime if and only if p has this 
property: 


whenever p | bc, then p |b or p|c. 
Proof» Since this is an “if and only if” statement, there are two parts to the proof. 


Step 1 Assume that p is prime and prove that p has the property stated in the theorem. 


Proof of Step 1: If pis prime and divides be, consider the god of p and b. 
Now (p, b) must be a positive divisor of the prime p. So the only possibilities 
are (p, b) = 1 and (p, b) = +p (whichever is positive). If (p, b) = +p, then 

p |b. If (p, b) = 1, since p|bc, we must have p|c by Theorem 1.4. In every 
case, therefore, p |b or p |c. Hence, p has the property stated in the theorem. 


Step 2 Assume that p is an integer that has the property stated in the theorem and 
prove that p is prime. 


Proof of Step 2: This proof is left to the reader (Exercise 14). E 


Corollary 1.6 


If p is prime and p|a,a,: - * ân, then p divides at least one of the a, 


Proof» If p |a, (aa; - + a,), then p |a; or p | aza; ` «+ a, by Theorem 1.5. If p |a, 
we are finished. If p |a, (aa; - ++ 4,), then p |a or p|a,a,--- a, by 
Theorem 1.5 again. If p |a, we are finished; if not, continue this process, 
using Theorem 1.5 repeatedly. After at most n steps, there must be an a, 
that is divisible by p. E 


Choose an integer other than 0, +1. If you factor it “as much as possible,” you will 
find that it is a product of one or more primes. For example, 


12=4-3=2:2:-3, 
60 = 12°5=2-2+3-5, 
113 = 113 (prime). 


In this context, we allow the possibility of a “product” with fust one factor in case the number 
we begin with is actually a prime. What was done in these examples can always be done: 


Theorem 1.7 


Every integer n except 0, +1 is a product of primes. 
Proof» First note that if n is a product of primes, say n = P p2 -* + Prs then —n = 


(—p,)p2 ° + * Pk is also a product of primes. Consequently, we need prove 
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the theorem only when » > 1. The idea of the proof can be summarized 
like this: 


Let S be the set of all integers greater than 1 that are not a product of 
primes. Show that S is the empty set. Then, since there are no integers 
in S, it must be the case that every integer greater than 1 is a product of 
primes (otherwise, it would be in S). 


Proof that S is empty: The proof is by contradiction: We assume that S is 
not empty and use that assumption to reach a contradiction. So assume that 
S is not empty. Then S contains a smallest integer m by the Well-Ordering 
Axiom. Since m E S, mis not itself prime. Hence m must have positive divi- 
sors other than 1 or m, say m = ab with 1 < a < mand 1 <b < m. Since 
both a and b are less than m (the smallest element of S ), neither a nor b is in 
S. By the definition of S, both aand b are the product of primes, say 


a=ppP: "P, and b=Giqn-+* Gy, 


with r = 1, s = 1, and each p,, g prime. Therefore 


m = ab = pP- >’ Pagg 


is a product of primes, so that m ¢ S. We have reached a contradiction: 
m E Sand m ¢# S. Therefore, S must be empty. @ 


Technology Tip: To find the prime factorization of integers as large as 10—12 dig- 
its on a TI graphing calculator, download the FACTOR program on our website 
(www. CengageBrain.com). The program uses Theorem 1.10, which is proved on 
page 21, to do the factorization. Maple and Mathematica can find the prime fac- 
torization of these and much larger integers very quickly. 


An integer other than 0, +1 that is not prime is called composite. Although a com- 
posite integer may have several different prime factorizations, such as 


45=3:3-5, 
45 = (-3) 5- (-3), 
45=5:3:3, 


45 = (-5): (3) 3, 


these factorizations are essentially the same. The only differences are the order of the 
factors and the insertion of minus signs. You can readily convince yourself that every 
prime factorization of 45 has exactly three prime factors, say ¢,q2q3. Furthermore, 
by rearranging and relabeling the g's, you will always have 3 = +q,, 3 = +q, and 
5 = +q. This is an example of the following theorem. 
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Theorem 1.8 The Fundamental Theorem of Arithmetic 


Every integer n except 0, +1 is a product of primes. This prime factorization 
is unique in the following sense: If 


N = PPa: P, and = = Gido++- Gs 


with each p; q; prime, then r = s (that is, the number of factors is the same) 
and after reordering and relabeling the g's, 


Pi = Èq Po = +4 Pa = Eqa +o i Pe = Egh 


Proof» Every integer n except 0, +1 has at least one prime factorization by 
Theorem 1.7. Suppose that # has two prime factorizations, as listed in 
the statement of the theorem. Then 


PIPP; - + + Pr) = 919293 + * * Fs» 


so that p,|q192- - - q,- By Corollary 1.6, pı must divide one of the q;. By 
reordering and relabeling the q’s if necessary, we may assume that p; |q}. 
Since p, and q, are prime, we must have p; = +q,. Consequently, 


£91 P2P3° + * Pr = 919293 ++ * Gee 
Dividing both sides by q, shows that 


PA*pspa- + + Pr) = 929394°** Ges 


so that p.| 9293 * * * qs- By Corollary 1.6, p must divide one of the q,; as 
before, we may assume pz |q2. Hence, pp = +q, and 


£42P3P4°** Pr = 929394 °° * Ie 
Dividing both sides by g, shows that 


P +p, Fox -P,) = 4394 °** Qe 


We continue in this manner, repeatedly using Corollary 1.6 and elimi- 
nating one prime on each side at every step. If r = s, then this process 
leads to the desired conclusion: p) = £q}, P} = +q,.-- , P, = +q, SO 
to complete the proof of the theorem, we must show that r = s. The 
proof that r = s is a proof by contradiction: We assume that r + s$ 
(which means that r > s or that r < s), and show that this assumption 
leads to a contradiction. 

First, suppose that r > s. Then after s steps of the preceding process, all 
the q’s will have been eliminated and the equation will read 


Pii Pitz Pr = L 
This equation says (among other things) that p, | 1. Since the only divi- 


sors of 1 are +1, we have p, = +1. However, since p, is prime, we know 
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that p,#+1 by the definition of “prime”. We have reached a contradic- 

tion (p, = +1 and p, # +1). So r > s cannot occur. A similar argument 
shows that the assumption r < s also leads to a contraction and, hence, 

cannot occur. Therefore, r = s is the only possibility, and the theorem is 
proved. W 


Technology Tip: The FACTOR program for TI calculators on our website 
(www.CengageBrain.com) factors an integer n as a product of primes relatively 
quickly. For example, if n = 94,017, then n = 3 - 7 - 11?» 37, as shown in Figure 1. 


N=?94617 


Bee, 


FIGURE 1 


On Maple, the command ifactor(n); will produce the prime factorization of n. 


If consideration is restricted to positive integers, then there is a stronger version of 
unique factorization: 


Corollary 1.9 


Every integer n > 1 can be written in one and only one way in the form 
Nn = PiPoPa° + > Pn where the p; are positive primes such that p; S po = 


Pg SSP, 


Proof» Exercise 12 m 


Primality Testing 


In theory it is easy to determine if a positive integer n is prime. Just divide n by every 
integer between 1 and x to see if n has a factor other than 1 or n. Actually, you need only 
check prime divisors because any factor of n (except 1) is divisible by at least one prime. 
The following primality test greatly reduces the number of divisions that are necessary. 


Theorem 1.10 


Let n > 1, If n has no positive prime factor less than or equal to Vn, then n 
is prime. 


Before proving this theorem, it may be helpful to see how it is used. 
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EXAMPLE 2 


To prove that 137 is prime, the theorem says that we must verify that 137 has no 
positive prime factors less than or equal to V137 = 11.7; that is, we need only 
show that 2, 3, 5,7, and 11 are not factors of 137. You can easily verify that 
none of them divide 137. Hence, 137 is prime by Theorem 1.10. 


The proof of Theorem 1.10 (like several earlier in this chapter) is somewhat more 
detailed than is necessary. In particular, the underlined parts of the proof are normally 
omitted. 


Proof of Theorem 1.10> The proof is by contradiction. Suppose that n is not 
prime. Then n has at least two positive prime factors, say pı and pz, 
so that n = p,p.k for some positive integer k. By hypothesis, n has no_ 


positive prime divisors less than or equal to Vn. Hence, p, > Vn and 
P2 > Vn. Therefore, 


n= p\p,k = pp, > Vavn =n, 


which says that n > n, a contradiction. Since the assumption that n is not 
prime has le a tradiction, we conclude that n is prime. E 


Theorem 1.10 is useful when working by hand with relatively small numbers. 
Testing very large integers for primality, however, requires a computer and techniques 
that are beyond the scope of this book. 


E Exercises 


A. 1. Express each number as a product of primes: 

(a) 5040 (b) —2345 
(c) 45,670 (d) 2,042,040 

2. (a) Verify that 25 — 1 and 27 — 1 are prime. 
(b) Show that 2!! — 1 is not prime. 

3. Which of the following numbers are prime: 
(a) 701 (b) 1009 
(c) 1949 (d) 1951 


4. Primes p and g are said to be twin primes if q = p + 2. For example, 3 and 5 are 
twin primes; so are 11 and 13. Find all pairs of positive twin primes less than 200. 


5. (a) List all the positive integer divisors of 35‘, where s, £ € Z and s, ¢ > 0. 
(b) If r,s, t © Z are positive, how many positive divisors does 2°375' have? 
6. If p> 5 is prime and p is divided by 10, show that the remainder is 1, 3, 7, or 9. 
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7. If a, b, c are integers and p is a prime that divides both a and a + bc, prove 
that p |b or p|c. 


8. (a) Verify that x — | is a factor of x" — 1. 


(b) If n is a positive integer, prove that the prime factorization of 2” - 3” — 1 
includes 11 as one of the prime factors. [Hint: (2"" - 3") = (22 - 3] 


9. Let p be an integer other than 0, +1. Prove that p is prime if and only if it 
has this property: Whenever r and s are integers such that p = rs, then r = 
+lors = +1. 


10. Let p be an integer other than 0, +1. Prove that p is prime if and only if for 
each a E Z either (a, p) = | or p|a. 


11. If a, b, c, dare integers and p is a prime factor of botha — band c — d, prove 
that p isa prime factor of (a + c) — (b + d). 


12. Prove Corollary 1.9. 


13. Prove that every integer n > | can be written in the form pi’ pz : + « př, with the 
p; distinct positive primes and every r; > 0. 


14. Let p be an integer other than 0, +1 with this property: Whenever b and c 
are integers such that p | bc, then p |b or p |c. Prove that p is prime. 
[Hint: If dis a divisor of p, say p = dt, then p |d or p | t. Show that this 
implies d= tpord= +1.] 


15. If p is prime and p |a", is it true that p" |a"? Justify your answer. 
[Hint: Corollary 1.6.] 


16. Prove that (a, b) = 1 if and only if there is no prime p such that p |a and p |b. 
17. If p is prime and (a, b) = p, then (a’, P) = ? 
18. Prove or disprove each of the following statements: 

(a) If p is prime and p|(a@ + $) and p|(c? + £), then p| (a@’ — c’). 

(b) If p is prime and p|(a? + b?) and p|(c? + £), thenp|(@ + c’). 

(c) If p is prime and p|a and p|(a* + 2), then p|b. 


B.19. Suppose that a = pi'p?---p%and b = pi'p#--- pe, where p, pr, ..., Py are 
distinct positive primes and each r, sı = 0. Prove that a | b if and only if 
r; S s; for every i. 


20. If a = piipyp}--- pē and b = pi p?p}. +- pe, where py, Pa, ..., py are distinct 
positive primes and each r, s; = 0, then prove that 


(a) (a, b) = pip p* . . - pe, where for each i, n, = minimum of Fr, Si 


(b) [a, b] = pi p3p} -- + pz, where t, = maximum of r, s,. [See Exercise 31 in 
Section 1.2.] 


21. If ¢ = ab and (a, b) = 1, prove that a and b are perfect squares 


22. Let n = př p? -- - pē, where pj, Po, . . ., Py are distinct primes and each r; = 0. 
Prove that n is a perfect square if and only if each z; is even. 


23. Prove that a| b if and only if a| 5”. [Hint: Exercise 19.] 
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24. Prove that a| b if and only if a" | b". 

25. Let p be prime and 1 = k < p. Prove that p divides the binomial coefficient fa 
DN 

[recan that e) Io- | 


26. If n is a positive integer, prove that there exist n consecutive composite 
integers. [Hint: Consider (n + 1)! + 2, (n + 1)! +3, (n +1) +4,....] 


27. If p > 3 is prime, prove that p* + 2 is composite. [Hint: Consider the possible 
remainders when p is divided by 3.] 


28. Prove or disprove: The sums 
1+2+4, 1+2+4+8, 1+2+4+8+16,... 
are alternately prime and composite. 


29. If n € Z andn + 0, prove that n can be written uniquely in the form n = 2*m, 
where k = 0 and m is odd. 


30. (a) Prove that there are no nonzero integers a, b such that a* = 257. 
[Hint: Use the Fundamental Theorem of Arithmetic.] 
(b) Prove that ‘V2 is irrational. [Hint: Use proof by contradiction (Appendix A). 
Assume that V2 = a/b (with a,b € Z) and use part (a) to reach a contradiction.] 


31. If p is a positive prime, prove that vp is irrational. [See Exercise 30.] 


32. (Euclid) Prove that there are infinitely many primes. [Hint: Use proof by 
contradiction (Appendix A). Assume there are only finitely many primes 
Pis Pas «+ - » Py, and reach a contradiction by showing that the number 
PiP2--* Py + 1 is not divisible by any of Pi, Pz, - - - » Pe-] 


33. Let p> 1. If 2? — 1 is prime, prove that p is prime. [Hint: Prove the 
contrapositive: If p is composite, so is 2? — 1.] 
Note: The converse is false by Exercise 2(b). 


C.34. Prove or disprove: If n is an integer and n > 2, then there exists a prime p such 
that n <p <al. 


35. (a) Let a bea positive integer. If Va is rational, prove that Va is an integer. 


(b) Let r be a rational number and a an integer such that 7” = a. Prove that r 
is an integer. [Part (a) is the case when n = 2.] 


36. Let p, q be primes with p = 5, q = 5. Prove that 24|(p? — q’). 
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CHAPTER 2 


Congruence in Z and Modular Arithmetic 


Basic concepts of integer arithmetic are extended here to include the idea of 
“congruence modulo n.” Congruence leads to the construction of the set Z, of all 
congruence classes of integers modulo n. This construction will serve as a model 
for many similar constructions in the rest of this book. It also provides our first 
example of a system of arithmetic that shares many fundamental properties with 
ordinary arithmetic and yet differs significantly from it. 


ey Congruence and Congruence Classes 


The concept of “congruence” may be thought of as a generalization of the equality 
relation. Two integers a and b are equal if their difference is 0 or, equivalently, if their 
difference is a multiple of 0. If n is a positive integer, we say that two integers are con- 
gruent modulo n if their difference is a multiple of n. To say that a — b = nk for some 
integer k means that n divides a — b. So we have this formal definition: 
Definition Let a, b, n be integers with n > 0. Then a is congruent to b modulo n 
[written “a = b (mod n)"), provided that n divides a — b. 


EXAMPLE 1 


17 = 5 (mod 6) because 6 divides 17 — 5 = 12. Similarly, 4 = 25 (mod 7) 
because 7 divides 4 — 25 = —21, and 6 = —4(mod 5) because 5 divides 
6 —-(—4) = 10. 


Remark In the notation “a = b (mod n),” the symbols “=” and “(mod n)” 
are really parts of a single symbol; “a = b” by itself is meaningless. Some texts write 
“a =, b” instead of “a = b (mod n).” Although this single-symbol notation is advanta- 
geous, we shall stick with the traditional “(mod x)” notation here. 
26 
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The symbol used to denote congruence looks very much like an equal sign. This is 
no accident since the relation of congruence has many of the same properties as the 
relation of equality. For example, we know that equality is 

reflexive: a = a for every integer a; 

symmetric: if a = b, then b = a; 

transitive: if a = band b = c, thena = c. 


We now see that congruence modulo 7 is also reflexive, symmetric, and transitive. 


Theorem 2.1 
Let n be a positive integer. For all a, b, cE Z, 
(1) a =a (mod n); 
(2) if a = b (mod n), then b = a (mod n); 
(3) if a = b (mod n) and b = c (mod n), then a = c (mod n). 


Proof » (1) To prove that a = a (mod n), we must show that » | (a — a). But 
a — a = 0 and n | 0 (see Example 2 on page 9). Hence, n | (a — a) and 
a = a (mod n). 


(2) a =b (mod n) means that a — b = nk for some integer k. Therefore, 
b — a = —(a — b) = —nk = n(—k). The first and last parts of this 
equation say that n | (b —-a). Hence, b = a (mod n). 


(3) If a = b (mod n) and b = c (mod n), then by the definition of 
congruence, there are integers k and ¢ such that a — b = nk and 
b — c = nt. Therefore, 


(a — b)+ (b — c) = nk + nt 
a—c=nk+®À). 


Thus z | (a — c) and, hence, a = c (mod n). E 
Several essential arithmetic and algebraic manipulations depend on this key fact: 
Ifa = bande = d, thena + c = b + dand ac = bd. 


We now show that the same thing is true for congruence. 


Theorem 2.2 

If a =b (mod n) and c = d (mod n), then 
(i) a+c=b+d (mod n); 
(2) ac = bd (mod n}. 
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Proof > (1) To prove that a + c = b + d (mod n), we must show that n divides 
(a + c) — (b + d). Since a = b (mod n) and c = d (mod n), we know that 
n|(a— b)andn|(b — d). Hence, there are integers k and t such that 


(*) a-b=nk and c-—d=nt. 


We use these facts to show that n divides (a + c) — (b + d): 


(a+c)—-(b+ a) =a+c-b-d [Arithmetic] 
=(a-—b)+(e-—4) [Rearrange terms.] 
=nk+nt [a — b = nkandc — d = nt] 

(a + c) — (b + d) = nfk +d [Factor right side] 


The last equation says that n divides (a + c) — (b + d). Hence, a + c = 
b + d (mod n). 


(2) We must prove that n divides ac — bd.* 
ac — bd = ac + 0 — bd 
=ac-bc+ bc—bd [—be +bc= 0] 
= (a — bje + b(c — d) [Factor first two terms and last two terms. 
= (nk)c + b(nt) [a — b = nk and c — d = nt by (*) above] 
ac — bd = n(ke + bt) [Factor n from each term.] 


The last equation says that n | (ac — bd). Therefore, ac = bd (mod n). W 


With the equality relation, it’s easy to see what numbers are equal to a given 
number a—just a itself. With congruence, however, the story is different and leads to 
some interesting consequences. 


Definition Let a and n be integers with n > 0. The congruence class of a modulo n 


(denoted [a]) is the set of all those integers that are congruent to a modulo 
n, that is, 


[a] = {bjbeZ and b=a(mod n)}. 


To say that b = a (mod n) means that b — a = kn for some integer k or, equivalently, 
that b = a + kn. Thus 


[a] = {b |b =a (mod n)} = {[b|b = a + kn with ke Z} 
= {a +kn| keg}. 


#The first two lines of this proof are a standard algebraic technique: Rewrite 0 in the form —X + X 
for a suitable expression X. 
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EXAMPLE 2 


In congruence modulo 5, we have 
[9] = {9 + 5k| keZ} = {9,9 + 5,9 + 10,9 + 15,...} 
= {...,—-11, —6, —1, 4, 9, 14, 19, 24, ...}, 
EXAMPLE 3 


The meaning of the symbol “[ ]” depends on the context. In congruence 
modulo 3, for instance, 


[2] = {2+ 3k | kKeEZ} = {..., —7, —4, —1, 2, 5, 8.3, 
but in oongruence modulo 5 the congruence class [2] is the set 


{2 + 5k|KEZ} = {..., -13, -8, -3, 2, 7, 12,...}. 


This ambiguity will not cause any difficulty when only one modulus is 
under discussion. On the few occasions when several moduli are discussed 
simultaneously, we avoid confusion by denoting the congruence class of a 
modulo x by [a]. 


EXAMPLE 4 


In congruence modulo 3, the congruence class 
(2) Si evens —Uy 4, 253s. E A 
Notice, however, that [— 1] 1s the same class because 
[-1] = {-1 + 3k |keZ} = {..., —7, —4, —1, 2, 5,...}. 


Furthermore, 2 = —1 (mod 3). This is an example of the following theorem. 


Theorem 2.3 


a =c (med n) if and only if [a] = [c]. 


Since Theorem 2.3 is an “if and only if” statement, we must prove two different 
things: 

1. If a=c (mod n), then [a] = [c]. 

2. If [a] = [e], then a = c (mod n). 


Neither of these proofs will use the definition of congruence. Instead, the proofs will 
use only the fact that congruence is reflexive, symmetric, and transitive (Theorem 2.1). 
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Proof of Theorem 2.3 » First, assume that a = c (mod n). To prove that [a] = [c], we first 
show that [a] [[c]. To do this, let bE [a]. Then by definition b= a(mod n). Since 
a = e (mod n), we have b = c (mod n) by transitivity. Therefore, be [c] and 
[a] & [c]. Reversing the roles of a and c in this argument and using the fact that 
c =a by symmetry, show that [c] S [a]. Therefore, [a] = [q]. 

Conversely, assume that [a] = [c]. Since a = a (mod n) by reflexivity, 

we have a E [a] and, hence, a e[c]. By the definition of [c], we see that 
a=c(modn). E 


If A and C are two sets, there are usually three possibilities: Either A and C are dis- 
joint, or A = C, or A N C is nonempty but A +C. With congruence classes, however, 
there are only two possibilities: 


Corollary 2.4 


Two congruence classes modulo n are either disjoint or identical. 


Proof» if [a] and [c] are disjoint, there is nothing to prove. Suppose that 
[a] N [e] is nonempty. Then there is an integer b with bE [a] and b €[c]. 
By the definition of congruence class, b = a (mod n) and b = c (mod n). 
Therefore, by symmetry and transitivity, a = c (mod n). Hence, [a] = [c] 
by Theorem 2.3. E 


Corollary 2.5 


Let n > 1 be an integer and consider congruence modulo n. 


(1) Hf ais any integer and r is the remainder when a is divided by n, then 
[a] = [r]. 

(2) There are exactly n distinct congruences classes, namely, [0], [1], 
[2], ue moe 1]. 


Proof» (1) Let a EZ. By the Division Algorithm, a = ng + r, with 0 =r < n. 
Thus a — r = gn, so that a = r (mod an). By Theorem 2.3, [a] = [r]. 


(2) If [a] is any congruence class, then (1) shows that [a] = [r] with 

0 =r <n. Hence, [a] must be one of [0], [1], [2], ..., [” — 1]. 
Tocomplete the proof, we must show that these n classes are all distinct. 

To do this, we first show that no two of 0, 1, 2,...,”— 1 are congruent 

modulo n. Suppose that s and ¢ are distinct integers in the list 0,1,2,..., 

n — 1. Then one is larger than the other, say t, so that 0 = s < t < n. 

Consequently, t — s is a positive integer that is less than n. Hence, n does 

not divide ¢ — s, which means that ż # s. Thus, no two of 0, 1, 2,..., 

n — 1 are congruent modulo n. Therefore, by Theorem 2.3, the classes [0], 

(1), [2] . . -, [2 — 1] are all distinct. m 
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Definition The set of all congruence classes modulo n is denoted Z, (which is read 


“Z mod n"). 


There are several points to be careful about here. The elements of Z, are classes, 
not single integers. So the statement [5] EZ, is true, but the statement 5€ Z, is not. 
Furthermore, every element of Z, can be denoted in many different ways. For example, 
we know that 


2 = 5 (mod 3) 2 = —1 (mod 3) 2 = 14 (mod 3). 
Therefore, by Theorem 2.3, [2] = [5] = [—1] = [14] in #3. Even though each element 
of Z,, (that is, each congruence class) has infinitely many different labels, there are only 
finitely many distinct classes by Corollary 2.5, which says in effect that 


The set Z, has exactly z elements. 


For example, the set Z3 consists of the three elements [0], [1], [2]. 


E Exercises 


A. 1. Show that a?—! = 1 (mod p) for the given p and a: 
(a) a=2,p=5 (b) a=4,p=7 (c) a=3,p=11 
2. (a) If k = 1 (mod 4), then what is 6k + 5 congruent to modulo 4? 


(b) If r = 3 (mod 10) and s = —7 (mod 10), then what is 27 + 3s congruent to 
modulo 10? 


3. Every published book has a ten-digit ISBN-10 number (on the back cover 
or the copyright page) that is usually of the form x|—x2x3X4—XsX6X7X8Xo-X10 
(where each x; is a single digit).* The first 9 digits identify the book. The last 
digit x49 is a check digit; it is chosen so that 


10x, + 9x3 + 8x3 + 7X4 + 6x5 + 5x + 4X + 3x + 2x9 + xı = 0 (mod 11). 


If an error is made when scanning or keying an ISBN number into a computer, 
the left side of the congruence will not be congruent to 0 modulo 11, and the 
number will be rejected as invalid.? Which of the following are apparently valid 
ISBN numbers? 


(a) 3-540-90518-9 (b) 0-031-10559-5 (c) 0-385-49596-X 


*Sometimes the last digit of an ISBN number is the letter X. In such cases, treat X as if it were the 
number 10. 


tThe procedures in Exercises 3 and 4 will detect every single digit substitution error (for instance, 
3is entered as 8 and no other error is made). They will detect about 90% of transposition errors (for 
instance, 74 is entered as 47 and no other error is made). However, they may not detect multiple errors. 


(Capyrighs 2012 Camgage Leaning Alt Qighis Ramrvat Alay not be copied, amal oc dupficatart, in whole or ia part Dus to elecirunic cights, mama third party contest may be apprmd firan the a Book ador e haptarfa). Edixriai ewview baa 
Gece d that eny appi costed dows not matacelty sfibci the oven emning apaiara Caga Learning asv the right w seme effitional naam at ay ns if suhega tipths cemrictirms require it. 


2.1 Congruence and Congruence Classes 31 


4. Virtually every item sold in a store has a 12-digit UPC barcode which is scanned 
at the checkout counter. The first 11 digits of a UPC number d,d,d,- + - + dirdi 
identify the manufacturer and product. The last digit dj, is a check digit which 
is chosen so that 


3d, + d; + 3d; + dy + 3d; + dg + 3d, + dg + 3dy + dio + 3d; + dy = 0 (mod 10). 


If the congruence does not hold, an error has been made and the item must 
be scanned again, or the UPC code entered by hand. Which of the following 
UPC numbers were scanned incorrectly? 


(a) 037000356691 (b) 833732000625 (c) 040293673034 
5. (a) Which of [0], [1], [2], [3] is equal to [5°™] in Z,? [Hint: 5 = 1 (mod 4); use 
Theorems 2.2 and 2.3.] 


(b) Which of [0], [1], [2], [3], [4] is equal to [4] in Z? 
. If a = b (mod n) and k | n, is it true that a = b (mod k)? Justify your answer. 
. If ae Z prove that a” is not congruent to 2 modulo 4 or to 3 modulo 4. 
. Prove that every odd integer is congruent to 1 modulo 4 or to 3 modulo 4. 
. Prove that 
(a) (n — af = a (mod n) (b) (2n — a)? = a? (mod 4n) 


10. If a is a nonnegative integer, prove that a is congruent to its last digit mod 10 
[for example, 27 = 7 (mod 10)]. 


B.11. If a, b are integers such that a = b (mod p) for every positive prime p, prove 
that a = b. 


12. If p = 5 and pis prime, prove that [p] = [1] or [p] = [5] in %. 
[Hint: Theorem 2.3 and Corollary 2.5.] 


13. Prove that a = b (mod n) if and only if a and b leave the same remainder when 
divided by n. 


14. (a) Prove or disprove: If ab = 0 (mod n), then a = 0 (mod n) or b = 0 (mod n). 
(b) Do part (a) when n is prime. 


O eon A 


15. If (@, n) = 1, prove that there is an integer b such that ab = 1 (mod n). 


16. If [a] = [1] in Z,, prove that (a, n) = 1. Show by example that the converse 
may be false. 


17. Prove that 10" = (—1}" (mod 11) for every positive n. 


18. Use congruences (not a calculator) to show that 
(125698) (23797) # 2891235306. [Hint: See Exercise 21.] 


19. Prove or disprove: If [a] = [b] in Z,, then (a, n) = (b, n). 


20. (a) Prove or disprove: If a? = b? (mod a), then a = b (mod n) or 
a = —b (mod n). 


(b) Do part (a) when n is prime. 
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21. (a) Show that 10" = 1 (mod 9) for every positive n. 


(b) Prove that every positive integer is congruent to the sum of its digits mod 
9 [for example, 38 = 11 (mod 9)]. 


22. (a) Give an example to show that the following statement is false: If ab = ac 
(mod n) and a ¥ 0 (mod n), then b = c (mod n). 


(b) Prove that the statement in part (a) is true whenever (a, n) = 1. 


EXCURSION: The Chinese Remainder Theorem (Section 14.1) may be 


covered at this point if desired. 


Ea Modular Arithmetic 


The finite set Z, is closely related to the infinite set Z. So it is natural to ask if it is 
possible to define addition and multiplication in Z, and do some reasonable kind of 
arithmetic there. To define addition in Z„ we must have some way of taking two classes 
in Z, and producing another class—their sum. Because addition of integers is defined, 
the following tentative definition seems worth investigating: 

The sum of the classes [a] and [c] is the class containing a + c or, in symbols, 


[a] © [e] = [a + d, 


where addition of classes is denoted by @ to distinguish it from ordinary addition of 
integers. 
We can try a similar tentative definition for multiplication: 


The product of [a] and [c] is the class containing ac: 
[a] © [q] = [ae], 
where © denotes multiplication of classes. 


EXAMPLE 1 


In Z we have [3] ® [4] = [3 + 4] = [7] = [2] and [3] © [2] = [3 + 2] = [6] = [1]. 


Everything seems to work so far, but there is a possible difficulty. Every element of 
Z, can be written in many different ways. In Z, for instance, [3] = [13] and [4] = [9]. In 
the preceding example, we saw that [3] ® [4] = [2] in Z4. Do we get the same answer if 
we use [13] in place of [3] and [9] in place of [4]? In this case the answer is “yes” because 


[13] ®© [9] = [13 + 9] = [22] = [2]. 


But how do we know that the answer will be the same no matter which way we write 
the classes? 
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To get some idea of the kind of thing that might go wrong, consider these five 
classes of integers: 


’ —14, —8, =2 0, 6, 12, 18, we 3 
ry ts —7, -3, 1, 5; 9, 13, arà 3 


..., l= Or 2, 8, 14, 20, . . .} 


m S&S QA & & 
1 


{ 
{ 
fg Dy Sy aT Seadh 
{ 
{ 


os, = 15,6) 4, 10, 16, 22, .. } 


These classes, like the classes in Z, have the following basic properties: Every integer 
is in one of them, and any two of them are either disjoint or identical. Since 1 is in B 
and 7 is in C, we could define B + C as the class containing 1 + 7 = 8, that is, B + C= 
D. But B is also the class containing —3 and C the class containing 15, and so B + C 
ought to be the class containing —3 + 15 = 12. But 12 isin A, so that B + C = A. Thus 
you get different answers, depending on which “representatives” you choose from the 
classes B and C. Obviously you can’t have any meaningful concept of addition if the 
answer is one thing this time and something else another time. 

In order to remove the word “tentative” from our definition of addition and mul- 
tiplication in Z,, we must first prove that these operations do not depend on the 
choice of representatives from the various classes. Here is what’s needed: 


Theorem 2.6 


If [a] = [b] and [c] = [d in Za, then 
[a+c]=[b+d] and [ac] = [bd]. 
Proof > since [a] = [b], we know that a = b (mod n) by Theorem 2.3. Similarly, 
[c] = [d] implies that c = d (mod n). Therefore, by Theorem 2.2, 
a+c=b+d(modz) and ac=bd(modn). 
Hence, by Theorem 2.3 again, 
[a +c] = [b + d] and [ac] = [bd]. E 


Because of Theorem 2.6, we know that the following formal definition of addition 
and multiplication of classes is independent of the choice of representatives from each 
class: 


Definition Addition and multiplication in Z, are defined by 
[a] [c]=[a+e] and [a]6 [c] = [ec]. 
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EXAMPLE 2 


Here are the complete addition and multiplication tables for Z, (verify that 
these calculations are correct):* 


® |o 2 B A ojo mM B B A 
Oo; © B BA [0] | 0] [0] =f [0] [O 
MM} B B) [4 [0] [0 | 0O 1) 2 B) A 
2/2 8 A A [i [2] | [0] 2) 4 Tf [3] 
B] | BI 4 O O] [2 [B] 0 8 O] A [2 
4 | 4 © D 2 BB 4 | 0O 4 B B2 H 


And here are the tables for Zg: 


®© | (0) Ho B B) A [S 
[0] | [0] OOo 2 8 ë A4 B 
O) | 2 B A 5 A 
2] | B] 3 A 5S O 
B] | 5B] 4] [5] 0] 8 B 
[4] | [4] 5 O N BA B 
[5] | [5 0 G 2 B) A 


o | 0 N BA B 4 B 
oOo; o Oo © A fF A 
WY) Oo © A BB FA S 
2); A A oO a 4 
[B] | Oo BI © BF OM Bf) 
4 oo a A oO A B 
|} oOo S A B ë BA WW 


Properties of Modular Arithmetic 


Now that addition and multiplication are defined in Z,,we want to compare the properties 
of these “miniature arithmetics” with the well-known properties of Z The key facts about 
arithmetic in Z (and the usual titles for these properties) are as follows. For all a, b, cE Z: 


1. If a, bEZ, thena +bEZ [Closure for addition] 
2. a+ (b+ c)=(a+b)+c. [Associative addition] 
3.a+b=b+a. [Commutative addition] 
4.a+0=a=0+a. [Additive identity] 


*These tables are read like this: If [2] appears in the left-hand vertical column and [c] in the top 
horizontal row of the addition table, for example, then the sum [a] @ [c] appears at the intersection 
of the horizontal row containing [a] and the vertical column containing [c]. 
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5. For each ac Z, the equation 
a+ x = 0 has a solution in Z. 


6. If a, bEZ, then abez. [Closure for multiplication] 
7. abc) = (ab)c [Associative multiplication] 
8. alb + c) = ab + ac and 
(a + bje = ac + be. [Distributive laws] 
9. ab = ba [Commutative multiplication] 
10.a-l=a=l:a [Multiplicative identity] 


11. If ab = 0, then a = 0 or b = 0. 


By using the tables in the preceding example, you can verify that the first ten of 
these properties hold in Z; and Z6 and that Property 11 holds in Z, and fails in 
#,. But using tables is not a very efficient method of proof (especially for verify- 
ing associativity or distributivity). So the proof that Properties 1—10 hold for 
any Ž,„ is based on the definition of the operations in Z, and on the fact that 
these properties are known to be valid in Z. 


Theorem 2,7 


For any classes [a], [b], [c] in Z, 


. If [a] eZ, and [b] € Z, then [a] @ [b] € Z, 

. [a] © ([b] © (cl) = ([a] © (6) @ [c]. 

- [a] © [b] = [b] @ [a]. 

- [a] © [0] = [a] = [0] © [a]. 

. For each [a] in Z,, the equation [a] ® X = [0] has a solution in Z,. 

. If [a] EZ, and [b]E Z,, then [a] © [b]E 2. 

. [a] © ([b] © [c]) = ([a] © [b]) © [c]. 

. [a] © ((6] @ [c}) = [a] © [b] @ [a] © [c] and 
((a] © [6]) © [c] = [a] © [c] © [b] © [c]. 

9, [a] © [b] = [b] © [a]. 

10. [a] O [1] = [a] = [1] © [a]. 


SN gagra UNa‘ 


Proof » Properties 1 and 6 are an immediate consequence of the definition of ® 
and © in Z, 


To prove Property 2, note that by the definition of addition, 
[a] © (H @ [d) = fel O [b + d = [a + (b + c). 


In Z we know that a + (b + c) = (a + b) + c. So the classes of these 
integers must be the same in Z,; that is, [a + (b +c] = [(a + b) + c]. By 
the definition of addition in Z,, we have 


[a + b) +c] = [a + b) Ọ [d = (al © BD Old. 
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This proves Property 2. The proofs of Properties 3, 7, 8, and 9 are 
analogous (Exercise 10). 
Properties 4 and 10 are proved by a direct calculation; for instance, 
[a] © [1] = [a- 1] = [a]. 
For Property 5, it is easy to see that Y = [—a] is a solution of the 
equation since [a] ® [—a] = [a + (-a)] = [0]. Em 
Exponents and Equations 
The same exponent notation used in ordinary arithmetic is also used in Z}. If [a] EZ, 
and k is a positive integer, then [a]* denotes the product 
[a] © [a] © [a] O - - -O [a] (k factors). 


EXAMPLE 3 
In Zs, [3] = [3] © [B] = [4] and [3]* = [3] © [3] © [3] © [3] = [1]. 
As noted on page 9, the set Z, has exactly n elements. Consequently, any equation 


in Z,, can be solved by substituting each of these » elements in the equation to see 
which ones are solutions. 


EXAMPLE 4 


To solve x? @ [5] © x = [0] in Zs, substitute each of [0], [1], [2], [3], [4], and [5] 
in the equation to see if it is a solution: 


x xr@O[5s]}Ox Is x7 ® [5] © x = [0]? 
[0] [0O & [S]O[O}] = [0] & [0] = [0] Yes; solution 

1 HON e SOl] = [1] © [5] = [0] Yes; solution 

2} 20 B SER = [4] HA = 2] No 

[3] BIOR] © [SJO[3] = [3] © [3] = [0] Yes; solution 

[4] Aod & [S]O[4] = [4] & [2] = [0] Yes; solution 

[} = [5]©[5] © [51015] = [1] © [1] = [2] No 


So the equation has four solutions: [0], [1], [3], and [4]. 


Example 4 shows that solving equations in 7, may be quite different from solving 
equations in Z. A quadratic equation in Z has at most two solutions, whereas the 
quadratic equation x° @ [5]Ox = [0] has four solutions in Z,. 


E Exercises 


A. 1. Write out the addition and multiplication tables for 
(a) Z (b) 4 © Z (d) Zz 
In Exercises 2-8, solve the equation. 


2. 2 @x =[0]inZ 


Currioty 2012 Cmore> Loans. Al Bight Gnerent May oct bs copia exon. oc dugir and. in whale oc in pact. Dus 10 electronic dahi», mms thied pety coma may te appead firm the eBook emioc eChagtar(s). Edaurial review bas 
that noy appi cose does oct oxtedely affect the ovaal learning axperiama Cmgege Loaming manye the right (0 sacro eXdicianel canine! mt any tins if sihana rights cestricticms require it. 


2.3 The Structure of Z, (p Prime) and Z, 37 


x? = [1] in Z 

4 = [l] inZ, 

x? © [3] © x @® [2] = [0] in Z 
. x2 @ [8] Ox = [0] in Z 
SOrOxO[l]=Ming 
. x +37 = [2] in Zo 


. (a) Find an element [a] in Z, such that every nonzero element of Z} is a power 
of [a]. 


(b) Do part (a) in Z. (c) Can you do part (a) in Z6? 
10. Prove parts 3, 7, 8, and 9 of Theorem 2.7. 
11. Solve the following equations. 

(a) x®x@x = [0] in Z, 

(b) x® x @x@x = [0] in Z, 

C) xBxOx@x@x =[inZ 
12. Prove or disprove: If [a] © [4] = [0] in Z,, then [a] = [0] or [b] = [0]. 
13. Prove or disprove: If [a] © [5] = [a] © [c] and [a] # [0] in Z,, then [5] = [c]. 

B. 14. Solve the following equations. 
(a) x7+ x = [0] in Z, 
(b) x2 + x = [0] in Zs 


(c) If p is prime, prove that the only solutions of x? + x = [0] in Z, are [0] and 
[p — I]. 


15. Compute the following products. 
(a) (la ® [bD in Z 
(b) ([a] ® [b]? in Z, [Hint: Exercise 11(a) may be helpful.] 
(c) (la [b inZ,  [Hint: See Exercise 11(c).] 
(d) Based on the results of parts (a)—(c), what do you think ([a] @ [b])’ is 
equal to in Z,? 
16. (a) Find all [a] in Z, for which the equation [a] © x = [1] has a solution. Then 
do the same thing for 


(b) % (c) Z (d) 4 


eer nn ps w 


A The Structure of Z, (p Prime) and Z, 


We now present some facts about the structure of %, (particularly when z is prime) 
that will provide a model for our future. work. First, however, we make a change of 
notation. 
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New Notation 


We have been very careful to distinguish integers in Z and classes in Z, and have 
even used different symbols for the.operations in the two systems. By now, however, 
you should be reasonably comfortable with the fundamental ideas and familiar with 
arithmetic in Z,. So we shall adopt a new notation that is widely used in mathemat- 
ics, even though it has the flaw that the same symbol represents two totally different 
entities. 

Whenever the context makes clear that we are dealing with Z,, we shall abbrevi- 
ate the class notation “[a]” and write simply “a.” In 4, for instance, we might say 
6 = 0, which is certainly true for classes in Z even though it is nonsense if 6 and 
0 are ordinary integers. We shall use an ordinary plus sign for addition in Z, and 
either a small dot or juxtaposition for multiplication. For example, in Z, we may 
write things like 


4+1=0 or 3°4=2 or 4+4=3. 


On those few occasions where this usage might cause confusion, we will return to the 
brackets notation for classes. 


EXAMPLE 1 


In this new notation, the addition and multiplication tables for Z, are 


CAUTION: Exponents are ordinary integers—not elements of Z,„. In Z3, 
for instance, 24 = 2:2-2+2 = 1 and 2! = 2, so that 24 + 2! 
even though 4 = | inZ,. 


The Structure of Z, When p Is Prime 


Some of the Z, do not share all the nice properties of Z. For instance, the product 
of nonzero integers in Z is always nonzero, but in Z we have 2 - 3 = 0 even though 
2 + Oand 3 + 0. On the other hand, the multiplication table on page 34 shows that the 
product of nonzero elements in Z, is always nonzero. Indeed, Z; has a much stronger 
property than Z. When a + 0, the equation ax = 1 has a solution in Z if and only if 
a = +1. But the multiplication table for Z, shows that, for any a + 0, the equation 
ax = | has a solution in Z,; for example, 


x = 3 is a solution of 2x = 1 
x = 4isa solution of 4x = 1. 
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More generally, whenever n is prime, Z, has special properties: 


Theorem 2.8 


If p > 1 is an integer, then the following conditions are equivalent:* 


(1) p is prime. 
(2) For any a # 0 in Zp, the equation ax = 1 has a solution in £. 
(3) Whenever be = 0 in Z,, then b =Oorc =0, 


The proof of this theorem illustrates the two basic techniques for proving state- 
ments that involve Z,: 
G) Translate equations in Z, into equivalent congruence statements in Z. Then 
the properti:s of congruence and arithmetic in Z can be used. The brackets 
notation for elements of Z, may be necessary to avoid confusion. 
(ii) Use the arithmetic properties of Z, directly, without involving arithmetic in Z. 
In this case, the brackets notation in Z, isn’t needed. 


Proof of Theorem 2.8 » (1) = (2) We use the first technique. Suppose p is prime 
and [a] + [0] in Z. Then in Z, a # 0 (mod p) by Theorem 2.3. Hence, 
p * aby the definition of congruence. Now the gcd of a and p is a posi- 
tive divisor of p and thus must be either p or 1. Since (a, p) also divides 
aand p ¢ a, we must have (a, p) = 1. By Theorem 1.2, au + pu = 1 for 
some integers u and v. Hence, au — 1 = p(—v), so that au = 1 (mod p). 
Therefore [au] = [1] in Z by Theorem 2.3. Thus [a][u] = [au] = [1], so 
that x = [u] is a solution of [a]x = [1]. 

(2) = (3) We use the second technique. Suppose ab = 0 in Z. If 

a = 0, there is nothing to prove. If a + 0, then by (2) there exists ue Z, 
such that au = 1. Then 


0 = 4:0 = u(ab) = (ua)b = (au)b = 1 -b =b 


In every case, therefore, we have a = 0 or b = 0. 


(3) = (1) Back to the first technique. Suppose that b and c are any 
integers and that p | be. Then bc = 0 (mod p). So by Theorem 2.3, 


[blic] = [bc] = [0] in Z. 


Hence, by (3), we have [b] = [0] or [c] = [0]. Thus, b = 0 (mod p) or c = 0 
(mod p) by Theorem 2.3, which means that p | b or p | c by the definition 
of congruence. Therefore, p is prime by Theorem 1.5. E 


The Structure of Z, 


When n is not prime, the equation ax = 1 need not have a solution in Z}. For instance, 
the equation 2x = 1 has no solution in Z4, as you can easily verify. The next result tells 
us exactly when ax = 1 does have a solution in Z,. For clarity, we use brackets notation. 


*See page 508 in Appendix A for the meaning of “the following conditions are equivalent" and what 
must be done to prove such a statement. 
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Theorem 2.9 


Let a and n be integers with n > 1. Then 
The equation [a]x = [1] has a solution in Z, if and only if (a, n) = 1 in Z. 


Proof» Since this is an “if and only if” statement, the proof has two parts. 
First we assume that the equation has a solution and show that (a, n) = 1. 
If [w] is a solution of [a]x = [1], then 


[allw] = [1] 


[aw] = [1] [Multiplication in Z,) 
aw = 1 (mod n) in Z [Theorem 2.3] 
aw —1=knforsomeintegerk [Definition of congruence] 
aw + n(—k) = 1 [Rearrange terms] 


Denote (a, n) by d. Since dis a common divisor of a and n, there are inte- 
gers rand s such that dr = a and ds = n. So we have 


aw + n(—k) = 1 
drw + ds(—k) = 1 
d(rw — sk) =1. 
So d| 1. Since d is positive by definition, we must have d = 1, that is, (a,n) = 1. 
Now we assume that (a, n) = 1 andshow that [a]x = [l]has a solu- 
tion in Z,. Actually, we’ve already done this. In the proof of (1) = (2) 
of Theorem 2.8, the primeness of p is used only to show that (a, p} = 1. 
From there on, the proof is valid in any Z, when (a, n) = 1, and shows 
that [a]x = [1] has a solution inZ,. @ 


Units and Zero Divisors 


Some special terminology is often used when dealing with certain equations, An ele- 
ment a in Z, is called a unit if the equation ax = 1 has a solution. In other words, a is 
a unit if there is an element b in Z, such that ab = 1. In this case, we say that b is the 
inverse of a. Note that ab = 1 also says that b is a unit (with inverse a). 


EXAMPLE 2 


Both 2 and 8 are units in 2), because 2 - 8 = 1. 8 is the inverse of 2 and 2 is the 
inverse of 8. Similarly, 3 is a unit in Z4 because 3 - 3 = 1. So 3 is its own inverse. 


EXAMPLE 3 


Part (2) of Theorem 2.8 says that when p is prime, every nonzero element of Z, 
is a unit. 


Here is a restatement of Theorem 2.9 in the terminology of units. 
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Theorem 2.10 


Let a and n be integers with n > 1. Then 
[a] is a unit in Z, if and only if (a, n) = 1 inZ. 


A nonzero element a of Z, is called a zero divisor if the equation ax = 0 has a 
nonzero solution (that is, if there is a nonzero element c in Z, such that ac = 0). 


EXAMPLE 4 


Both 3 and 5 are zero divisors in 7, because 3 - 5 = 0. Similarly, 2 is a zero 
divisor in Z, because 2 2 = 0. 


EXAMPLE 5 


Part (3) of Theorem 2.8 says that when p is prime, there are no zero divisors in Z}. 


E Exercises 


A. 1. Find all the units in 


(a) Z, b) & © Z (d) Zio- 
2. Find all the zero divisors in 
(a) Z, b) & © Z (d) Zio 
3. Based on Exercises 1 and 2, make a conjecture about units and zero divisors 
in Z,. 
4. How many solutions does the equation 6x = 4 have in 
(a) Z,? (b) Z? (c) Z,? (d) Zio? 


5. If a is a unit and b is a zero divisor in Z,, show that ab is a zero divisor. 


6. If n is composite, prove that there is at least one zero divisor in Z,. (See 
Exercise 2.) 


7. Without using Theorem 2.8, prove that if p is prime and ab = 0 in Z, then 
a= 0 orb = 0. [Hint: Theorem 1.8.] 


8. (a) Give three examples of equations of the form ax = b in 2, that have no 
nonzero solutions. 


(b) For each of the equations in part (a), does the equation ax = 0 havea 
nonzero solution? 


B. 9. (a) If ais a unit in Z,, prove that a is not a zero divisor. 


(b) If a is a zero divisor in Z,, prove that a is not a unit. [Hint: Think 
contrapositive in part (a).] 
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10. Prove that every nonzero element of Z, is either a unit or a zero divisor, but 


11. 


12. 


13. 


14. 


15. 


16. 


17. 
18. 


not both. [Hint: Exercise 9 provides the proof of “not both”.] 


Without using Exercises 13 and 14, prove: If a, b € Z, and a is a unit, then the 
equation ax = b has a unique solution in Z,. [Note: You must find a solution 
for the equation and show that this solution is the only one.] 


Let a, b, n be integers with n > 1 and let d = (a, n). If the equation [a]x = [b] 
has a solution in Z,, prove that d | b. [Hint: If x = [r] is a solution, then [ar] = 
[b] so that ar — b = kn for some integer k.] 


Let a, b, n be integers with n > 1. Let d = (a, n) and assume d | b. Prove that 
the equation [a]x = [b] has a solution in Z, as follows. 


(a) Explain why there are integers u, v, a), b}, n, such that au + nv = d, 
a= da, b = db, n = dn. 


(b) Show that each of 
[ub], [ub, + nı], [ub; + 2], [ub, + 371), . . . , lub; + (d — Dn] 
is a solution of [a]x = [b]. 


Let a, b, n be integers with n > 1. Let d = (a, n) and assume d | b. Prove that 
the equation [a]x = [b] has d distinct solutions in Z, as follows. 
(a) Show that the solutions listed in Exercise 13 (b) are all distinct. 

[Hint: [r] = [s] if and only if n | (r — s).] 


(b) If x = [r] is any solution of [a]x = [b], show that [r] = [ub, + kn] for some 
integer k with 0 < k s d — 1. [Hint: [ar] — [aub,] = [0] (Why?), so that 
n | (a(r — ub,)). Show that n; | (a(r — ub,)) and use Theorem 1.4 to show 
that nı | (r = ub;).] 


Use Exercise 13 to solve the following equations.s 
(a) 15x = 9 in Lig (b) 25x = 10in Xs. 


If a + 0 and b are elements of Z, and ax = b has no solutions in Z,, prove that 
a is a zero divisor. 


Prove that the product of two units in Z, is also a unit. 


The usual ordering of Z by < is transitive and behaves nicely with respect to 
addition. Show that there is no ordering of Z, such that 


(i) if a< band db <c, thena < c 
(ii) if a < b, thena + c < b + c for every cinZ,. 


[Hint: If there is such an ordering with 0 < 1, then adding 1 repeatedly to both 
sides shows that 0 < 1 < 2 <:--<n—1 by (ii). Thus 0 < 2-— 1 by (i). Add 1 
to each side and get a contradiction. Make a similar argument when 1 < 0.] 


APPLICATION: Public Key Cryptography (Chapter 13) may be covered 
at this point if desired. 
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CHAPTER 3 


Rings 


ALTERNATE ROUTE: If you want to cover groups before studying rings, 
you should read Chapters 7 and 8 now. 


We have seen that many rules of ordinary arithmetic hold not only in Z but also in 
the miniature arithmetics Z,. You know other mathematical systems, such as the 
real numbers, in which many of these same rules hold. Your high-school algebra 
courses dealt with the arithmetic of polynomials. 

The fact that similar rules of arithmetic hold in different systems suggests 
that it might be worthwhile to consider the common features of such systems. 
In the long run, this might save a lot of work: If we can prove atheorem about one 
system using only the properties that it has in common with a second system, 
then the theorem is also valid in the second system. By “abstracting” the com- 
mon core of essential features, we can develop a general theory that includes 
as special cases Z, Z, and the other familiar systems. Results proved for this 
general theory will apply simultaneously to all the systems covered by the theory. 
This process of abstraction will allow us to discover the real reasons a particular 
statement is true (or false, for that matter) without getting bogged down in non- 
essential details. In this way a deeper understanding of all the systems involved 
should result. 

So we now begin the development of abstract algebra. This chapter is just 
the first step and consists primarily of definitions, examples, and terminology. 
Systems that share a minimal number of fundamental properties with Z and Z, 
are called rings. Other names are applied to rings that may have additional prop- 
erties, as you will see in Section 3.1. The elementary facts about arithmetic and 
algebra in arbitrary rings are developed in Section 3.2. In Section 3.3 we consider 
rings that appear to be different from one another but actually are “essentially the 
same” except for the labels on their elements. 
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2) Definition and Examples of Rings 


We begin the process of abstracting the common features of familiar systems with this 
definition: 


Definition A ring is a nonempty set R equipped with two operations* (usually written 
as addition and multiplication) that satisfy the following axioms. For all a, 
DEER: 
1. faeRand beR, then a + beR. [Closure for addition] 
2. a+ (b+c)=(a+b)+c. [Associative addition] 
3.a+b=b+a. [Commutative addition] 
4, There is an element Op in R such [Additive identity 
that a + Op = a = 0g + a for every or zero element] 
ace. 
5. For each a ER, the equation 
a +x = Og has a solution in R.t 
6. lfaeRand DER, then abeR. (Closure for multiplication] 
7. a(bc) = (ab)c. [Associative multiplication) 
8. alb +c) = ab + ac and (Distributive laws] 
(a + de = ac + be, 
These axioms are the bare minimum needed for a system to resemble Z and Z,,. But 
Z and Z, have several additional properties that are worth special mention: 
Definition A commutative ring is a ring Rthat satisfies this axiom: 
9. ab = ba for alla, b ER. [Commutative multiplication] 
Definition A ring with identity is a ring A that contains an element 1g satisfying this 
axiom: 
10. aig =a = 1ga forall a ER. (Multiplicative identity] 


**Operation” and “closure” are defined in Appendix B. 


tThose who have already read Chapter 7 should note that Axioms 1-5 simply say that a ring is an 
abelian group under addition. 
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In the following examples, the verification of most of the axioms is left to the 
reader. 


EXAMPLE 1 
With the usual addition and multiplication, 
Z (the integers) and R (the real numbers) 


are commutative rings with identity. 


EXAMPLE 2 


The set Z,, with the usual addition and multiplication of classes, is a commuta- 
tive ring with identity by Theorem 2.7. 


EXAMPLE 3 


Let E be the set of even integers with the usual addition and multiplication. 
Since the sum or product of two even integers is also even, the closure 

axioms (1 and 6) hold. Since 0 is an even integer, E has an additive identity 
element (Axiom 4). If a is even, then the solution of a + x = 0 (namely — a) is 
also even, and so Axiom 5 holds. The remaining axioms (2, 3, 7, 8, and 9) 
hold for all integers and, therefore, are true whenever a, b, c are even. 
Consequently, E is a commutative ring. E does not have an identity, however, 
because no even integer e has the property that ae = a = ea for every even 
integer a. 


EXAMPLE 4 


The set of odd integers with the usual addition and multiplication is not a 
ring. Among other things, Axiom | fails: The sum of two odd integers is 
not odd. 


Although the definition of ring was constructed with Z and Z, as models, there 
are many rings that aren’t at all like these models. In these rings, the elements may not 
be numbers or classes of numbers, and their operations may have nothing to do with 
“ordinary” addition and multiplication. 
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EXAMPLE 5 


The set T = {r, s, t, z} equipped with the addition and multiplication defined 
by the following tables is a ring: 


You may take our word for it that associativity and distributivity hold 
(Axioms 2, 7, and 8). The remaining axioms can be easily verified from the 
operation tables above. In particular, they show that 7 is closed under both 
addition and multiplication (Axioms 1 and 6) and that addition is commuta- 
tive (Axiom 3). 

The element z is the additive identity—the element denoted Og in Axiom 4. It be- 
haves in the same way the number 0 does in Z (that’s why the notation 0. is used in the 
axiom), but z is not the integer 0—in fact, it’s not any kind of number. Nevertheless, 
we shall call z the “zero element” of the ring T. 

In order to verify Axiom 5, you must show that each of the equations 


r+x=2z Stx=2 t+x=2 z+x=2 


has a solution in T. This is easily seen to be the case from the addition table; for 
example, x = ris the solution of r + x = z because r + r = z. 

Finally, note that T is not a commutative ring; for instance, rs = r and 
sr = Z, so that rs # sr. 


EXAMPLE 6 


Let M(R) be the set of all 2 X 2 matrices over the real numbers, that is, M(R) 
consists of all arrays 


C a where a, b, c, d are real numbers. 


Two matrices are equal provided that the entries in corresponding positions are equal; 


that is, 
a b Pog : : 
( y= ( ) if and only if a=rb=s,c=t,d=u. 
c d t u 
For example, 


(3 Nan GPI 
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Addition of matnces is defined by 


ab +e b\ fata b+b' 
c d €e d) \cte dtd] 
For example, 


3 -A 4 Fy _feta see fT 5 
5 1 6 0) \5+6 1+0) Wu If 


Multiplication of matrices is defined by 
a a w X\  fawt by ax+ bz 
c d)\y z) \cw+dy cxtaz) 
For example, 


a acer aceryea) 
-(2 a 


Reversing the order of the factors in matrix multiplication may produce a different 
answer, as is the case here: 


Rg a ly ad 
"la x 


So this multiplication is not commutative. With a bit of work, you can verify that 
M(R)is a ring with identity. The zero element is the zero matrix 


0 = 
0 O07 
cena’ -a —b\. ; 
which is denoted 0 and ¥ = 2 i Nis a solution of 
ab 0 0 
( V kaia” o} 


Weclaimthat the multiplicative identity element (Axiom | 0)is the matrix = ( 0 a 


To prove this claim, we first multiply a typical matrix in M(R) on the right by F: 


i AG Nat dee eek ea j 
c dJ\0 1) \cr1l+d-0 c-0+d-1) \ec df 
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Since multiplication is not commutative here, we also need to check left multiplication 
by Zas well: 


1 O\/fa b\ fl+tatO0re 1:b6+0:d\_ [a b 
0 Whke d) \Orat+ lee Ob +1:dJ \e df 


This proves that Z satisfies Axiom 10.* Coasequently, Zis called the identity matrix. 
Note that the product of nonzero elements of M(R) may be the zero element; for 


example, 
Goha o Gera ade g 
EXAMPLE 7 


If R is a commutative ring with identity, then M(R) denotes the set of all 

2 X 2 matrices with entries in R. With addition and multiplication defined as 
in Example 6, M(R) is a noncommutative ring with identity, as you can read- 
ily verify. For instance, M(Z) is the ring of 2 X 2 matrices with integer entries, 
M(Q) the ring of 2 X 2 matrices with rational number entries, and M(Z,) the 
ring of 2 X 2 matrices with entries from 7,,. 


EXAMPLE 8 


Let T be the set of all functions from R to R, where R is the set of real 
numbers. As in calculus, f + g and fg are the functions defined by 


(f+ g(x) = fix) + gx) and —(fg)(x) = Sf). 


You can readily verify that T is a commutative ring with identity. The zero ele- 
ment is the function h given by A(x) = 0 for all x€ R. The identity element is the 
function e given by ex) = 1 for all x ER. Once again the product of nonzero 
elements of T may turn out to be the zero element; see Exercise 36. 


We have seen that some rings do not have the property that the product of two 
nonzero elements is always nonzero. But some of the rings that do have this property, 
such as Z, occur frequently enough to merit a title. 


Definition An integral domain is a commutative ring R with identity 1, + Op that 
satisfies this axiom: 


11. Whenever a, b& R and ab = Op, then a = Op or b = Op. 


*Checking a possible identity element under both right and left multiplication is essential. There 
are rings in which an element acts like an identity when you multiply on the right, but not when you 
multiply on the left. See Exercise 11. 
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The condition |p + Og is needed to exclude the zero ring (that is, the single-element 
ring {0g}) from the class of integral domains. Note that Axiom 11 is logically equiva- 
lent to its contrapositive.* 


Whenever a # Op and b # Op, then ab + Op. 


EXAMPLE 9 


The ring Z of integers is an integral domain. If p is prime, then Z, is an integral 
domain by Theorem 2.8. On the other hand, @, is not an integral domain because 
4-3 = 0, even though 4 + 0and3 #0. 


You should be familiar with the set Q of rational numbers, which consists of all 
fractions a/b with a, b €Z and b # 0. Equality of fractions, addition, and multiplica- 
tion are given by the usual rules: 


a ae : “ 
ae if and only if as = br 

a ,c_adt be a c_ ae 

a” a bd b d bd 


It is easy to verify that Q is an integral domain. But Q has an additional property that 
does not hold in Z: Every equation of the form ax = 1 (with a + 0) has a solution in 
Q. Therefore, Q is an example of the next definition. 
Definition A field is:a commutative ring R with identity 1, # Og that satisfies this 
axiom: 


12. For each a # 0g in R, the equation ax = 1, has a solution in R. 
Once again the condition lg # Opg is needed to exclude the zero ring. Note that 


Axiom 11 is not mentioned explicitly in the definition of a field. However, Axiom 11 
does hold in fields, as we shall see. in Theorem 3.8 below. 


EXAMPLE 10 


The set R of real numbers, with the usual addition and multiplication, is a field. 
If p is a prime, then Z, is a field by Theorem 2.8. 


EXAMPLE 11 


The set C of complex numbers consists of all numbers of the form a + bi, 
where a, b E€ R and Ë = —1. Equality in € is defined by 


at bi=r+ si if and only if a=randb=s. 


*See Appendix A for a discussion of contrapositives. 
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The set C is a field with addition and multiplication given by 


(a + b)+(c+ dad) =(a+c)+(b+d)i 
(a + bi)(c + di) = (ac — bd) + (ad + boji. 


The field R of real numbers is contained in C because R consists of all complex 
numbers of the form a + 0i. If a + bi # 0 in C, then the solution of the equation 
(a + bi)x = 1 is x = c + di, where 


c=afa+HXER and = —b/(a? + B)ER (verify!). 


EXAMPLE 12 


Let K be the set of all 2 X 2 matrices of the form 


ki 


where a and b are real numbers. We claim that Kis a field. For any two matrices in K, 
a b rs c = atic b+ 
-b a -d c} \-b-d ate 
a by fe _[{ ac—bd ad + be 
-b a -å ce} \-ad-—be ac— bd} 


In each case the matrix on the right is in K because the entries along the main 
diagonal (upper left to lower right) are the same and the entries on the opposite 


diagonal (upper right to lower left) are negatives of each other. Therefore, K is 
closed under addition and multiplication. K is commutative because 


Ae ae ee ee 


Clearly, the zero matrix and the identity matrix J are in K. If 


a b 
A= 
(5 <) 
is not the zero matrix, then verify that the solution of 4X = Jis 


; ee —bld 


ae! 2 
bld rai EK, where d =a’ + bt. 


Whenever the rings in the preceding examples are mentioned, you may assume 
that addition and multiplication are the operations defined above, unless there is some 
specific statement to the contrary. You should be aware, however, that a given set (such 
as Z) may be made into a ring in many different ways by defining different addition 
and multiplication operations on it. See Exercises 17 and 22-26 for examples. 
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Now that we know a variety of different kinds of rings, we can use them to produce 
new rings in the following way. 


EXAMPLE 13 


Let Tbe the Cartesian product Z6 X Z, as defined in Appendix B. Define 
addition in Tby the rule 


(a,2z) + (a’, z) = (a +a',z + 7’). 
The plus sign is being used in three ways here: In the first coordinate on the right-hand 
side of the equal sign, + denotes addition in Z4; in the second ordinate, + denotes 
addition in Z; the + on the left of the equal sign is the addition in T that is being defined. 
Since Z; is ating anda, a’ E Zg, the first coordinate on the right, a + a’, isin Z,. Similarly 
z+2' € Z. Therefore, addition in T is closed. Multiplication is defined similarly: 
(a, z)(a’, 2’) = (aa’, zz’). 


For example, (3, 5) + (4, 9) = (3+ 4, 5 + 9) = (1, 14) and (3, 5)(4, 9) = 

(3° 4,5 +9) = (0, 45). You can readily verify that T is a commutative ring with 
identity. The zero element is (0, 0), and the multiplicative identity is (1, 1). What 
was done here can be done for any two rings. 


Theorem 3.1 
Let R and S be rings. Define addition and multiplication on the Cartesian 
product R X S by 

(A5)+(F',8)=(r+r,s4+s') and (r,s)(r',8") = (rr, ss‘). 


Then R X S is a ring. If R and S are both commutative, then so is R X S. If both 
Rand S have an identity, then so does R X S. 


Proof» Exercise 33. m 


Subrings 


If R is a ring and S is a subset of R, then S may or may not itself be a ring under the 
operations in R. In the ring Z of integers, for example, the subset E of even integers is 
a ring, but the subset O of odd integers is not, as we saw in Examples 3 and 4. When 
a subset S of a ring R is itself a ring under the addition and multiplication in R, then 
we say that S is a subring of R. 


EXAMPLE 14 


Z isa subring of the ring Q of rational numbers and Q is a subring of the field 
R of all real numbers. Since Q is itself a field, we say that Q is a subfield of R. 
Similarly, R is a subfield of the field C of complex numbers. 
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EXAMPLE 15 


The matrix rings M(Z) and M(Q)in Example 7 are subrings of M(R). 


EXAMPLE 16 


The ring K in Example 12 is a subring of M(R). 


EXAMPLE 17 


Let T be the ring of all functions from R to R in Example 8. Then the subset S 
consisting of all continuous functions from R to R is a subring of T. To prove 
this, you need one fact proved in calculus: The sum and product of continuous 
functions are also continuous. So Sis closed under addition and multiplication 
(Axioms | and 6). You can readily verify the other axioms. 


Proving that a subset S of a ring R is actually a subring is easier than proving directly 
that Sis a ring. For instance, since a + b = b + a for all elements of R, this fact is also true 
when a, b happen to be in the subset S. Thus Axiom 3 (commutative addition) automati- 
cally holds in any subset S of a ring. In fact, to prove that a subset of a ring is actually a 
subring, you need only verify a few of the axioms for a ring, as the next theorem shows. 


Theorem 3.2 


Suppose that R is a ring and that S is a subset of R such that 


(i) S is closed under addition (if a, b ES, then a + be S); 

(ii) Sis closed under multiplication (if a, be S, then ab ES); 
(iii) OQ, ES; 
(iv) If aeS, then the solution of the equation a + x = Ogis in S. 


Then S is a subring of R. 


Note condition (iv) carefully. To verify it, you need not show that the equation 
a+ x = 0p has a solution—we already know that it does because R is a ring. You need 
only show that this solution is an element of S (which implies that Axiom 5 holds for S). 


Proof of Theorem 3.2 > As noted before the theorem, Axioms 2, 3, 7, and 8 hold 
for all elements of R, and so they necessarily hold for the elements of the 
subset S. Axioms 1, 6, 4, and 5 hold by (i)}+{iv). m 


EXAMPLE 18 


The subset S = {0,3} of Zs is closed under addition and multiplication 
(0 +0=0;0 + 3 = 3;3 + 3 = 0; similarly, 0-0 = 0 = 0-3; 3-3 = 3). By the 
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definition of S we have 0 € S. Finally, the equation 0 + x = 0 has solution 
x = 0E S, and the equation 3 + x = 0 has solution x = 3 € S. Therefore, Sis a 
subring of Z, by Theorem 3.2. 


EXAMPLE 19 


Let S be the subset of M(R) consisting of all matrices of the form (5 a 
Then S is closed under addition and multiplication because a 


a 0 r 0 atr 0+0 a+r 0 
s y+ JG an ee aes and 
a O\/r 0\_ ar 6 z 
B EINE t br+cs ct 
The identity matrix is in S (let a = 1, b = 0, c = 1) and the solution of 


E Ual Oe ee Bg 
b ak A Oe -b - i 
Hence S is a subring by Theorem 3.2. 


EXAMPLE 20 


The set ZV 24 = {a+ bV2 |a,b e Z}isa subring of R. You can easily verify 
that 


(a + BV2)(c + dV?) = ac + adV2 + bceV/2 + bd V2 + V2 
= (ac + 2bd) + (ad + bc) V2) e Z[V2]. 


So Z2[V2] is closed under multiplication. See Exercise 13 for the rest of the proof. 


E Exercises 


A. 1. The following subsets of Z (with ordinary addition and multiplication) satisfy 
all but one of the axioms for a ring. In each case, which axiom fails? 


(a) The set S of all odd integers and 0. 
(b) The set of nonnegative integers. 


2. Let R = {0, e, b, c} with addition and multiplication defined by the tables on 
page 54. Assume associativity and distributivity and show that R is a ring with 
identity. Is R commutative? Is R a field? 
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+] 0 ae br fe GO e¢ & 
0 | 0 e b cœ olio 0 0 0 
e |e 0 c b e 0 g- vb e 
bjb c 0 e bit b b 0 
Ele -b e 0 c 0 é O 


3. Let F = {0, e, a, b} with operations given by the following tables. Assume 
associativity and distributivity and show that F is a field. 


0 
0 
0 
0 
0 


on A lO 
a es Sœ ole 


0 
0 
e 
a 
b 


~a na oO 
ga FS oO RI] 
so Oe QIK 
eS 8 8&8 CO 
eS a œ oja 
so & 8 oja 


4. Find matrices 4 and Cin M(R) such that AC = 0, but CA + 0, where 0 is the 
zero matrix. [Hint: Example 6.] 


5. Which of the following six sets are subrings of M(R)? Which ones have an identity? 


(a) All matrices of the form l ‘) with reQ. 


~ 


(b) All matrices of the form k with a, b, cE Z. 
(c) All matrices of the form i A with a, b, ce R. 
(d) All matrices of the form k o) with aER. 


(e) All matrices of the form C 4 witha ER. 


© 
Q 


(f) All matrices of the form p a witha eR. 


6. (a) Show that the set R of all multiples of 3 is a subring of Z. 
(b) Let k be a fixed integer. Show that the set of all multiples of k is a subring of Z. 


7. Let K be the set of all integer multiples of V2, that is, all real numbers of the 
form nV/2 with neZ. Show that K satisfies Axioms 1-5, but is not a ring. 


8. Is the subset {1,—1, 4 —i} a subring of C? 

9. Let R be aring and consider the subset R* of R X R defined by R* = {(r, r)|r E R}. 
(a) If R = Ze list the elements of R*. 
(b) For any ring R, show that R* is a subring of R x R. 
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10. Is S= {(a, b) |a + b = 0} a subring of Z X Z? Justify your answer. 


11. Let S be the subset of M(R) consisting of all matrices of the form 5 o, 


(a) Prove that S is a ring. 


1 
(b) Show that J = ( ) is a right identity in S (meaning that AJ = A for 
every A in S). 0 


(c) Show that J is not a /eft identity in S by finding a matrix B in S such that 
JB + B. 


For more information about S, see Exercise 41. 
12. Let Z[é] denote the set {a + bi | a, bE Z}. Show that Z[ġ is a subring of C. 


13. Let ZV2] denote the set {a + 5V2 | a, bE Z}. Show that ZV2] is a subring 
of R. [See Example 20.] 


14. Let T be the ring in Example 8. Let S = {fe T | (2) = 0}. Prove that S is a 
subring of T. 


15. Write out the addition and multiplication tables for 
(a) Z, X Z, (b)Z,xZ, ()2Z,x2, 


16. Let A = is i) and 0 = f á in M(R). Let S be the set of all matrices B 
such that AB = 0. 
(a) List three matrices in S. [Many correct answers are possible.) 


(b) Prove that Sis a subring of M(R). [Hint: If B and C are in S, show that 
B + Cand BC are in S by computing A(B + C) and A(BC).] 


17. Define a new multiplication in Z by the rule: ab = 0 for all a, b, E Z. Show that 
with ordinary addition and this new multiplication, Z is a commutative ring. 


18. Define a new multiplication in Z by the rule: ab = 1 for all a, b, € Z. With 
ordinary addition and this new multiplication, is Z is a ring? 


19. Let S = {a, b, c} and let P(S) be the set of all subsets of S; denote the 
elements of P(S) as follows: 


S = {a,b,c}; D= {a,b}; E= {a,c}; F= {b,c}; 
A= {a}; B= {b}; C= {chs 0=Ø. 
Define addition and multiplication in P(S) by these rules: 
M+N=(M-N)U(N-M) and MN=MNN 
Write out the addition and multiplication tables for P(S). Also, see Exercise 44. 


B. 20. Show that the subset R = {0, 3, 6, 9, 12, 15} of Z,,is asubring. Does R have 
an identity? 


21. Show that the subset S = {0, 2, 4, 6, 8} of Z; is a subring. Does S have an 
identity? 


Cogyrighs 2012 Cangnge Leauge AR Right Caamved May oct ba copied, esmai. or dupficmatt in whofe or is parl Dus to elactronic dakia, arma third pasty contest May be mppmaad fren the eBoc® eaWor eChageer(s). Ednrial ewview haa 
Gnome that any enpgremed atant doas oct metadaNy «ffsct the overt lexning axperiame Cangege Learning manrven the right 0 mave afiidonal contest at any tine if mipun tights cestricrines require it. 


56 Chapter3 Rings 


22. 


23. 


24. 


26. 


27. 


28. 


29. 


30. 


Define a new addition @ and multiplication © on Z by 
a@b=at+b-1 and a©Qb=a+b- ab, 


where the operations on the right-hand side of the equal signs are ordinary 
addition, subtraction, and multiplication. Prove that, with the new operations 
@ and ©, Z is an integral domain. 


Let E be the set of even integers with ordinary addition. Define a new 
multiplication * on E by the rule “a + b = ab/2” (where the product on the 
right is ordinary multiplication). Prove that with these operations £ is a 
commutative ring with identity. 
Define a new addition and multiplication on Z by 

a@b=at+b-1 and aOb=ab-(at+b)+2. 


Prove that with these new operations Z is an integral domain. 


. Define a new addition and multiplicationon Q by 


r®s=rt+st+l and rOs=rtrts. 


Prove that with these new operations Q is a commutative ring with identity. Is 
it an integral domain? 


Let L be the set of positive real numbers. Define a new addition and 
multiplication on L by 


a@®b=ab and a@b=a™., 
(a) Is La ring under these operations? 
(b) Is L a commutative ring? 
(c) Is La field? 


Let S be the set of rational numbers that can be written with an odd 
denominator. Prove that S is a subring of Q but is not a field. 


Let p be a positive prime and let R be the set of all rational numbers that can 
be written in the form r/p’ with r, i €Z, and i = 0. Note that Z C R because 
each n EZ can be written as n/p’. Show that R is a subring of Q. 


The addition table and part of the multiplication table for a three-element ring 
are given below. Use the distributive laws to complete the multiplication table. 


p 9 
|e w £ hil im se æ 
s s t r sS r t 
t t r sS t r 
Do Exercise 29 for this four-element ring: 
RU Æ y Zz wo x P Z 
w w X y Zz w w w w w 
x|x y 2 w x|w y 
yy z w w y| w w 
Z| 2 w oe y z | w w y 
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31. A scalar matrix in M(R) is a matrix of the form a A for some real 
number k. 


(a) Prove that the set of scalar matrices is a subring of M(R). 
(b) If K is a scalar matrix, show that KA = AK for every A in M(R). 
(c) If K is a matrix in M(R) such that KA = AK for every A in M(R), show 


that K is a scalar matrix. [Hint: If K = + a let A = k “| Use.the 


fact that KA = AK to show that b = 0 andc = 0. Then make a similar 


argument with 4 = (r t to show that a = d.] 


0 


32. Let R be a ring and let Z(R) = {a E€ R | ar = ra forevery r € R}. In other 
words, Z(R) consists of all elements of R that commute with every other 
element of R. Prove that Z(R) is a subring of R. Z(R) is called the center of 
the ring R. [Exercise 31 shows that the center of M(R) is the subring of scalar 
matrices.] 


33. Prove Theorem 3.1. 


34. Show that M(Z,) (all 2 X 2 matrices with entries in Z,) is a 16-element 
noncommutative ring with identity. 


35. Prove or disprove: 
(a) If Rand S are integral domains, then R X Sis an integral domain. 


(b) If Rand S are fields, then R X Sis a field. 
36. Let T be the ring in Example 8 and let f, g be given by 


0 ifx <2 _f2-x ifx<2 
OEE ME ifx>2 w= {5 if x > 2. 


Show that f, gE T and that fg = 07. Therefore Tis not an integral domain. 
37. (a) If Risa ring, show that the ring M(R) of all 2 X 2 matrices with entries in 
Risaring. 
(b) If R has an identity, show that M(R) also has an identity. 
38. If Ris aring andaeER, let Ap = {rE R| ar = 0g}. Prove that Ap is a subring 


of R. Apis called the right annihilator of a. [For an example, see Exercise 16 in 
which the ring S is the right annihilator of the matrix 4.] 


39. Let Q(V2) = (r + sV2 |r, s EQ}. Show that Q(V2) is a subfield of R. 
[Hint: To show that the solution of (r + #\/2)x = 1 is actually in av2), 
multiply 1/(r + sV2) by (r — sV D/E — #2) ] 

40. Let dbe an integer that is not a perfect square. Show that avd) = 
ja + bd | a, be Q} is a subfield of C. [Hint: See Exercise 39.] 
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41. Let S be the ring in Exercise 11. 
(a) Verify that each of these matrices is a right identity in S: 


a, : 2 
(3 aeli 4) 


(b) Prove that the matrix C = is a right identity in S if and only if 
x+y=l. y 


nle nj= 
N|= Nile 


(c) If x + y = 1, show that 4 *) is not a left identity in S. 


42. A division ring is a (not necessarily commutative) ring R with identity 
lr # Og that satisfies Axioms 11 and 12 (pages 48 and 49). Thus a field is a 
commutative division ring. See Exercise 43 for a noncommutative example. 
Suppose R is a division ring and a, b are nonzero elements of R. 


(a) If bb = b, prove that b = 1p. [Hint: Let v be the solution of bx = lp and 
note that bu = b’v.] 


(b) If «is the solution of the equation ax = 1g, prove that « is also a solution 
of the equation xa = lp. (Remember that R may not be commutative.) 


[Hint: Use part (a) with b = ua] 
43. In the ring M(C), let 
9 (2) «ty 
0 -i -1 0 i 0 


at 


The product of a real number and a matrix is the matrix given by this rule: 


Aa lle oa) 


The set H of real quaternions consists of all matrices of the form 


P fi o i Oo 0 1 6 i 
al + B+ oj + dk = of D + (4 Deel geal K 
= eG ie de ala a 
0 a 0 -bi —-e 0 di 0 
" at bi a 
—e+ di a-—bif 
where æ, b, c, and dare real numbers 
(a) Prove that 
? = jf =k? =- ij = -ji = k 


ik = -kj =i ki = -ik = į. 


(b) Show that H is a noncommutative ring with identity. 
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(c) Show that H is a division ring (defined in Exercise 42). [Hint: If M = al + 
bi + cj + dk, then verify that the solution of the equation Mx = 1 is the 
matrix tal — thi — tej — tdk, where t = 1/(a? + b? + c? + d?).] 

(d) Show that the equation x? = —1 has infinitely many solutions in H. 
[Hint: Consider quaternions of the form 01 + bi + cj — dk, where 
B+e7?4+d27=1] 

44. Let S be a set and let P(S) be the set of all subsets of S. Define addition and 
multiplication in P(S) by the rules 


M+N=(M-N)U(N-M™M) and MN= MAN. 
(a) Prove that P(S) is a commutative ring with identity. [The verification of 
additive associativity and distributivity is a bit messy, but an informal 


discussion using Venn diagrams is adequate for appreciating this example. 
See Exercise 19 for a special case.] 


(b) Show that every element of P(S) satisfies the equations x? = x and 
x+x=0 Sy 
C. 45. Let C be the set R X R with the usual coordinatewise addition (as in 
Theorem 3.1) and a new multiplication given by 
(a, b)(c, d) = (ac — bd, ad + be) 
Show that with these operations C is a field. 


46. Let r and s be positive integers such that r divides ks + 1 for some k with 
1< kÑ r. Prove that the subset {0, r, 2r, 3r,...,(s — 1)r} of Z, is a ring with 
identity ks + 1 under the usual addition and multiplication in Z,,. Exercise 21 
is a special case of this result. 


APPLICATION: Applications of the Chinese Remainder Theorem 


(Section 14.2) may be covered at this point if desired. 


EE Basic Properties of Rings 


When you do arithmetic in Z, you often use far more than the axioms for an integral 
domain. For instance, subtraction appears regularly, as do cancelation and the various 
rules for multiplying negative numbers. We begin by showing that many of these same 
properties hold in every ring. 


Arithmetic in Rings 


Subtraction is not mentioned in the axioms for a ring, and we cannot just assume 
that such an operation exists in an arbitrary ring. If we want to define a subtraction 
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operation in a ring, we must do so in terms of addition, multiplication, and the ring 
axioms. The first step is 


Theorem 3.3 


For any element a in a ring R, the equation a + x = 0, has a unique solution. 


Proof » We know that a + x = Or has at least one solution u by Axiom 5. If vis 
also a solution, then a + u = 0g anda + v = Og, so that 


v=Ogtv=(@tutv=(uta)tvaH=ut(@tr=utdg=u. 
Therefore, u is the only solution. @ 


We can now define negatives and subtraction in any ring by copying what happens 
in familiar rings such as Z. Let R be a ring and aE R. By Theorem 3.3 the equa- 
tion a + x = Og has a uniquesolution. Using notation adapted from Z, we denote this 
unique solution by the symbol “—a.” Since addition is commutative, 


—a is the unique element of R such that 
a+ (—a) = Og = (—a) + a. 


In familiar rings, this definition coincides with the known concept of the negative of 
an element. More importantly, it provides a meaning for “negative” in any ring. 


EXAMPLE 1 


In the ring Z, the solution of the equation 2 + x = 0 is 4, and so in this ring 
—2 = 4. Similarly, —9 = 5 in 71,4 because 5 is the solution of 9 + x = 0. 


Subtraction in a ring is now defined by the rule 
b — a means b + (—a), 


In Z and other familiar rings, this is just ordinary subtraction. In other rings we have 
a new operation. 


EXAMPLE 2 


In Z we have 1 —2=1+(-2.=14+4=5. 
In junior high school you learned many computational and algebraic rules for deal- 
ing with negatives and subtraction. The next two theorems show that these rules are 


valid in any ring. Although these facts are not particularly interesting in themselves, it 
is essential to establish their validity so that we may do arithmetic in arbitrary rings. 


Theorem 3.4 


lfa+b=a+tcinaringh, thenb=c. 
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Proof» Adding — ato both sides of a + b =a + c and then using associativity 
and negatives show that 


-a+(a+b)=-a+(a+c) 
(-a+a)+b=(-atat+c 
Op +b =Ogt+e 

b=c E 


Theorem 3.5 


For any elements a and b of a ring R, 
(1) a+ Og = Op = Op: a. In particular, Op * Op = Op. 
(2) a(—b) =—ab and (-—a)b = —ab. 
(8) ~(-a) = a. 
(4) —(a + b) = (—a) + (—6). 
(5) - (@ - b) = -a +b. 
(6) (—aX-b) = ab. 
If R has an identity, then 
(7) (—1g)a = —a. 
Proof > (1) Since 0p + Op = Op, the distributive law shows that 
a: Op + a+ Op = a(r + Og) =a‘ Og =a" Ont Op 
Applying Theorem 3.4 to the first and last parts of this equation shows 
that a « Og = Og. The proof that Op « a = Og is similar. 
(2) By definition, —ab is the unique solution of the equation 
ab + x = Og, and so any other solution of this equation must be equal 


to —ab. But x = a(—b)is a solution because, by the distribution law 
and (1), 


ab + a(—b) = a[b + (—4)] = a[0g] = 


Therefore, a(—b) = —ab. The other part is proved similarly. 

(3) By definition, —(—a) is the unique solution of (—a) + x = Og. But 
ais a solution of this equation since (—a) + a = 0g. Hence, —(—a) = a 
by uniqueness. 

(4) By definition, —{a + b) is the unique solution of (a + b) + x = 
Og, but (—a) + (—b) is also a solution, because addition is commutative, 
so that 


(a + b) + [(—a) + (—b)] = a + (—a) + b + (—b) 
= Op + Og 0k 
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Therefore, —(a + b) = (—a) + (—b) by uniqueness. 
(5) By the definition of subtraction and (4) and (3), 


—(a — b) = —(a + (—b)) = (—a) +(—(—4)) = —a + b. 
(6) (—a)(—4) = —(a (—b)) [By the second equation in (2), with —b in 


place of b) 
= —(—ab) [By the first equation in (2)] 
= ab [By (3), with ab in place of a] 


(7) By (2), 
(~la = —(1pa) = -@ = —a. m 


When doing ordinary arithmetic, exponent notation is a definite convenience, as is 
its additive analogue (for instance, a + a + a = 3a). We now carry these concepts over 
to arbitrary rings. If R is a ring, a€ R, and n is a positive integer, then we define 


a" = aaa '' -a (z factors). 
It is easy to verify that for any a € R and positive integers m and n, 
ad’ = d™" and (EY = a™. 


If Rhas an identity and a + Op, then we define a’ to be the element 1p. In this case, the 
exponent rules are valid for all m, n = 0. 
If R is a ring, a € R, and nis a positive integer, then we define 


na=at+atat*+*+a. (#Summands) 
—na = (—a) + (—a) + (—a) +++ + (—a). (7 summands) 


Finally, we define Oa = Og. In familiar rings this is nothing new, but in other rings it 
gives a meaning to the “product” of an integer n and a ring element a. 


EXAMPLE 3 


Let R bea ring and a, bE R. Then 
(a + bY = (a + bla + b) = a(a + b) + b(a + b) 
= aa + ab + ba + bb = d + ab + ba + b’. 


Be careful here. If ab + ba, then you aardt combine the middle terms If R is a com- 
mutative ring, however, then ab = ba and we have the familiar pattern 


(a + b} = @ + ab + ba + b? = a? + ab + ab + b? = a? + 2ab +H b. 


For a calculation of (a + b)” in a commutative ring, with n > 2, see the Binomial 
Theorem in Appendix E. 


It’s worth noting that subtraction provides a faster method than Theorem 3.2 for 
showing that a subset of a ring is actually a subring. 


Corruiy 2012 Cmonee Loans. Al Bight Rana May oct bs copia xm. a dugir and. in whale oc in pact. Des to electronic dghts, mms third paty comes may te expprmmnd fixan the eBook amor eChagtar(s). Ediaurial review bas 
that noy appi oaa does oct exateially affect the ovadil laning aperia Cangeze leaming maarves the right o rare eSdicional coninat mt any tins if subanqumt sights cestricicms require it. 


3.2 Basic Properties of Rings 63 


Theorem 3.6 


Let S be a nonempty subset of a ring R such that 
(1) S is closed under subtraction (if a, be sS, then a — be S}; 
(2) S is closed under multiplication (if a, b € S, then ab e 85). 
Then S is a subring of R. 


Proof» We show that S satisfies conditions ()—(iv) of Theorem 3.2 and hence 
is asubring. The conditions will be proved in this order: (ii), (iii), (iv), 
and (i). 


(ii) Hypothesis (2) here is identical with condition (ii) of Theorem 3.2. 
Hence, S satisfies condition (ii). 

(iii) Since Sis nonempty, there is some element c with ce S. Applying (1) 
(with a = c and b = c), we see that c:— c = Opis in S. Therefore, S 
satisfies condition (iii) of Theorem 3.2. 


(iv) If ais any element of S, then by (1), 02 — a = —a is also in S. Since 
—a is the solution of a + x = Op, condition (iv) of Theorem 3.2 is 
satisfied. 

(i) if a, bE sS, then —b is in S by the proof of (iv). By (1), a — (—b) = 
a+ b is in S. So S satisfies condition (1) of Theorem 3.2. 


Therefore, S isa subring of R by Theorem 3.2. E 


Units and Zero Divisors 


Units and zero divisors in Z, were introduced in Section 2.3. We now carry these con- 
cepts over to arbitrary rings. 


Definition An element a in a ring R with identity is called a unit if there exists u ER 
such thatau = 1p = ua. In this case the element u is called the (multiplica- 
tive) inverse of a and is denoted a. 


EXAMPLE 4 


The only units in Z are 1 and —1. 


EXAMPLE 5 


By Theorem 2.10, the units in Z,, are 1, 2, 4, 7, 8, 11, 13, and 14. For instance, 
2-8=1,so2-' =8and8"! =2. 


anh ere tec pp hein Rar pear ed Gapticatert, in whois ar ia part Dus to etecranic dghrs, nme third pasty content may be mppmmai fren the eBook astir eCbaptar(a). Ediawiel eeview bas 
that moy capgressed coset dow not @atedalty affect tha veal imning experience Cengage Learning manrves ths right w recevve ef@donel comes mi any time if sihexpam rights cenrictions raqeire it. 


64 Chapter3 Rings 


EXAMPLE 6 


1 
Every nonzero element of the field R is a unit: If a + 0, then@ ° a 1. The same 
thing is true for every field F. By definition, F satisfies Axiom 12: If a # 0,, then 
the equation ax = lp has a solution in F. Hence, 


Every nonzero element of a field is a unit. 


EXAMPLE 7 
A matrix i f in M(R) such that ad — be + 0 is a unit because, as you can 
easily verify, 
d —b d —b 
a b\[ ad-be ad-—be}) fl A 3 ad—be ad- be (: na” 0 
¢ od -i a "Mo ip = me a c d) \oO 1 
ad— bce ad— be ad— be ad- be 
In particular, each of these matrices is a unit: 
3 2 Ae 3 _ fifa 0 
e -(; s} ot s) c=(" A! 
Units in a matrix ring are called invertible matrices. 
EXAMPLE 8 
Let F bea field and M(F) the ring of 2 X 2 matrices with entriesin F. If 
A= ia (JE MUP and ad — be £ Op, then ad — bc is a unit in F by Example 6. 
The computations in Example 7, with ———— Ez be replaced by (ad — bc)~!, show that 4 is 
Deter rod tile : d(ad — be)™!  —b(ad — bc)! 
an invertible matrix [unit in MEJ withinvere(_ o agr p AT a 
Definition An element a in a ring A is a zero divisor provided that 
(1) a # Op. 


(2) There exists a nonzero element c in R such that ac = Og or ca = Op. 


Note that in requirement (2), the element c is not unique: Many elements in the ring 
may satisfy the equation ax = 0g or the equation xa = Opg (Exercise 6). Furthermore, 


‘Cheval 2012 Capa Laren (AN Rinin Berai Siy ox be pints or dapticanad, in whale or ia pert Dus to efsctronic aight», mas third party contest may be 


mm the eBook saor eChagnar(s). Edimwial eview has 
thet cay cappresend coated does oot mammiafty afec the overt learning aapeiamea Congigp Learning Gares the right to saD ve atidkionel costat ef aay thne if ethaxqam rights 


comricticms require it. 


3.2 Basic Properties of Rings 65 


in a noncommutative ring, it is possible to have ac = Og and ca + Op (Exercise 4 in 
Section 3.1). 


EXAMPLE 9 


Both 2 and 3 are zero divisors in Z, because 2 « 3 = 0. Similarly, 4 and 9 are 
zero divisors in Z4, because 4 +9 = 0, 

For a zero divisor A in a matrix ring, it is possible to find a matrix C such that 
AC = Oand CA =0. 


EXAMPLE 10 


Let F be a field. A nonzero matrix S = in M(F) such that ad — be = Opis a 


zero divisor because, as you can easily verify, 


a b a =p = 0 F 0 F aid d =f a b = 0 F 0 F 
c d}\-c a 0p O; —c ajc d Or Oz 
In particular, each of these matrices is a zero divisor in the given ring: 


A= fs ‘) in M(R), B= is A) in M(Q), and C= j :) in M(Z.). 


EXAMPLE 11 


Every integral R domain satisfies Axiom 11: If ab = Op, then a = Ogor b = Og. 
In other words, the product of two nonzero elements cannot be 0. Therefore, 


An integral domain contains no zero divisors. 


Finally, we present some useful facts about integral domains and fields. 


Theorem 3.7 

Cancelation is valid in any integral domain R: tf a # Og and ab = ac in R, then 
b=c. 

Cancelation may fail in rings that are not integral domains. In Z2, for instance, 


2°4=2- 10, but 4 # 10. 


Proof of Theorem 3.7» If ab = bc, then ab — be = Og, so that a(b — c) = Op. Since 
a # Og, we must have b — c = Og (if not, then a is a zero divisor, contra- 
dicting Axiom 11). Therefore,b=c. E 
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Theorem 3.8 


Every field F is an integral domain. 


Proof > Since a field is a commutative ring with identity by definition, we need 
only show that F satisfies Axiom 11: If ab = 0p, then a = Op or b = Op. 
So suppose that ab = Op. If b = Op there is nothing to prove. If b # Op, 
then 4 is a unit (Example 6). Consequently, by the definition of unit and 
part (1) of Theorem 3.5, 


a =alp= abb™ = Opb-! = Oz 


So in every case, a = Oy or b = 0r, Hence, Axiom 11 holds and Fis an 
integral domain. W 


The converse of Theorem 3.8 is false in general (Z is an integral domain that is not 
a field), but true in the finite case. 


Theorem 3.9 


Every finite integral domain £ is a field. 


Proof» Since R is a commutative ring with identity, we need only show that for 
each a # Op, the equation ax = 1, has a solution. Let a), a, ..., a, be 
the distinct elements of R and suppose a, # Og. To show that a,x = 1p 
has a solution, consider the products @,a1, 4,42, 4:43, . . > , Q,4n. If a; # a), 
then we must have a,a, # a,a, (because a,a; = aa, would imply that a, = a, 
by cancelation). Therefore, a,a,, 4,4}, . . . , a,a, are n distinct elements of 
R. However, R has exactly n elements all together, and so these must be 
all the elements of R in some order. In particular, for some j, œa; = 1p. 
Therefore, the equation a,x = 1g has a solution and Risa field. E 


E Exercises 


A. 1. Let R bea ring anda, DER. 
(a) (a + bXa-b) =? (b) a+b =? 
(c) What are the answers in parts (a) and (b) if R is commutative? 
2. Find the inverse of matrices A, B, and C in Example 7. 
3. An element e of a ring R is said to be idempotent if e? = e. 
(a) Find four idempotent elements in the ring M(R). 
(b) Find all idempotents in Z,9. 
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4. For each matrix A find a matrix C such that AC = 0 or CA = 0: 


a=() 3) a= 5 Be rE e KA 
x ay -2 47 3 -612 
5. (a) Show that a ring has only one zero element. [Hint: If there were more 
than one, how many solutions would the equation 0g + x = 0g have?] 
(b) Show that a ring R with identity has only one identity element. 


(c) Can a unit in a ring R with identity have more than one inverse? Why? 


6. (a) Suppose A and C are nonzero matrices in M(R) such that AC = 0. If k 
is any real number, show that A(KC) = 0, where kC is the matrix C with 
every entry multiplied by k. Hence the equation AX = 0 has infinitely 
many solutions. 


(b) If A = G A find four solutions of the equation 4X = 0. 


7. Let R be a ring with identity and let S = {nlp} | nE Z}. Prove that S is a 
subring of R. [The definition of na with n E Z, a E R is on page 62. Also see 
Exercise 27.] 


8. Let R be a ring and b a fixed element of R. Let T = {rb | r € R}. Prove that T 
is a subring of R. 


9. Show that the set S of matrices of the form 6 ee with a and b real 
numbers is a subring of M(R). a 


10. Let R and S be rings and consider these subsets of R X S: 
R= {(r,0)|rER} and S = {(Og, s)| seS}. 
(a) If R = Z, and S = &s. What are the sets Rand S? 
(b) For any rings R and S, show that R is a subring of R X S. 
(c) For any rings R and S, show that S is a subring of R X S. 


11. Let R be a ring and m a fixed integer. Let S = {r € R | mr = 0g}. Prove that S 
is a subring of R. 


12. Let a and b be elements of a ring R. 


(a) Prove that the equation a + x = b has a unique solution in R. (You 
must prove that there is a solution and that this solution is the only 
one.) 


(b) If R isa ring with identity and a is a unit, prove that the equation ax = b 
has a unique solution in R. 


13. Let Sand T be subrings of a ring R. In (a) and (b), if the answer is “yes,” 
prove it. If the answer is “no,” give a counterexample. 


(a) Is SA Tasubring of R? 
(b) Is SU Ta subring of R? 
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14. 


15. 


16. 
17. 
18. 


19. 
20. 


21. 


22. 


Prove that the only idempotents in an integral domain R are Op and 1p. (See 
Exercise 3.) 


(a) If aand bare units in aring R with identity, prove that ab is a unit whose 
inverse is (ab)! = b'at. 


(b) Give an example to show that if aand b are units, then a~'b—! need not be 
the multiplicative inverse of ab. 


Prove or disprove: The set of units in a ring R with identity is a subring of R. 
If «is a unit in a ring R with identity, prove that u is not a zero divisor. 


Let a be a nonzero element of a ring R with identity. If the equation ax = lp 
has a solution x and the equation ya = 1p has a solution v, prove that u = v. 


Let R and S be rings with identity. What are the units in the ring R X S? 


Let R and S be nonzero rings (meaning that each of them contains at least 
one nonzero element). Show that R X S contains zero divisors. 


Let R be a ring and let a be a nonzero element of R that is not a zero divisor. 
Prove that cancelation holds for a; that is, prove that 


(a) If ab = ac in R, then b = c. 
(b) If ba = cain R, then b = c. 
(a) If ab is a zero divisor in a ring R, prove that a or b is a zero divisor. 


(b) If a or b is a zero divisor in a commutative ring R and ab # Og, prove that 
ab is a zero divisor. 


23. (a) Let R be a ring and a, be R. Let m and n be nonnegative integers and 


24. 


25. 


B. 26. 


27. 


prove that 
(i) (m + n)a = ma + na. 

(ii) m(a + b) = ma + mb. 

(iti) m(ab) = (ma)b = a(mb). 

(iv) (ma)(nb) = mn(ab). 
(b) Do part (a) when m and n are any integers. 
Let R be a ring and a, bE R. Let m and n be positive integers. 
(a) Show that a”a” = a™*" and (a”)" = a™. 
(b) Under what conditions is it true that (ab)" = a"b"? 
Let S be a subring of a ring R with identity. 
(a) If S has an identity, show by example that 1s may not be the same as lp. 
(b) If both Rand S are integral domains, prove that ls = 1p. 


Let S be a subring of a ring R. Prove that 0s = Op. [Hint: For a E S, consider 
the equation a + x = a.] 


Let R be aring with identity and ba fixed element of R and let S = {nb | nE 2}. 
Is S necessarily a subring of R? [Exercise 7 is the case when b = 1,.] 
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28. Assume that R = {0g, lg, a, b} is a ring and that a and b are units. Write out 
the multiplication table of R. 


29. Let R be a commutative ring with identity. Prove that R is an integral domain 
if and only if cancelation holds in R (that is, a # Og and ab = ac in R imply 
b=). 


30. Let R be a commutative ring with identity and bE R. Let T be the subring of all 
multiples of b (as in Exercise 8). If u is a unit in R and u E T, prove that T = R. 


31. A Boolean ring is a ring R with identity in which x = x for every x€ R. For 
examples, see Exercises 19 and 44 in Section 3.1. If R is a Boolean ring, prove that 


(a) a + a = 0g for every ae R, which means that a = —a. [Hint: Expand 
(a+ a°] 


(b) Ris commutative. [Hint: Expand (a + b)?] 


32. Let R be a ring without identity. Let T be the set R X Z. Define addition and 
multiplication in T by these rules: 


(r, m) + (s, n) = (r + s,m + n). 
(r, m)(s, n) = (rs + ms + nr, mn). 
(a) Prove that Tis a ring with identity. 


(b) Let R consist of all elements of the form (r, 0) in T. Prove that R is a 
subring of T. 


33. Let R be a ring with identity. If ab and a are units in R, prove that b is a unit. 


5 
34. Let F bea field and A = p ) a matrix in M(F). 


d, 


(a) Prove that A is invertible if and only if ad — be + 0p. [Hint: Examples 7, 
8, and 10 and Exercise 17.] 


(b) Prove that 4 is a zero divisor if and only if ad — be = Op 
b riety 5 ; 
35. Let d = i h be a matrix with integer entries. 


(a) If ad — be = +1, show that A is invertible in M(Z). [Hint: Example 7.] 


(b) If ad — be + 0, 1, or —1, show that A is neither a unit nor a zero divisor in 
M(@). [Hint: Show that A has an inverse in M(R) that is not in M(Z); see 
Exercise 5(c). For zero divisors, see Exercise 34(b) and Example 10.] 


36. Let R be a commutative ring with identity. Then the set M(R) of 2 X 2 
matrices with entries in R) is a ring with identity by Exercise 37 of Section 3.1. 


If A = G a E M(R) and ad — bc is a unit in R, show that A is invertible in 


d 
M(R). [Hint: Replace by (ad — bc)~' in Example 7] 


ad — be 
37. Let R be a ring with identity and a, bE R. Assume that a is not a zero divisor. 


Prove that ab = 1p, if and only if ba = 1p. [Hint: Note that both ab = lg and 
ba = lgimply aba = a (why?); use Exercise 21.] 
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38. Let R be a ring with identity and a, bE R. Assume that neither a nor 6 is a 
zero divisor. If ab is a unit, prove that a and b are units. [Hint: Exercise 21.] 


39. (a) If Risa finite commutative ring with identity and a E R, prove that ais 
either a zero divisor or a unit. [Hint: If ais not a zero divisor, adapt the 
proof of Theorem 3.8, using Exercise 21.] 


(b) Is part (a) true if R is infinite? Justify your answer. 


40. An element a of a ring is nilpotent if a” = Op for some positive integer n. 
Prove that R has no nonzero nilpotent elements if and only if Op is the unique 
solution of the equation x*-= Op. 


The following definition is needed for Exercises 41—43. Let R be a ring with identity. 
Uf there is a smallest positive integer n such that nlg = Op, then R is said te have 
characteristic n. If no such n exists, R is said to have characteristic Zero. 


41. (a) Show that Z has characteristic zero and Z, has characteristic n. 

(b) What is the characteristic of Z4 X Z6? 
42. Prove that a finite ring with identity has characteristic n for some nr > 0. 
43. Let R be a ring with identity of characteristic n > 0. 

(a) Prove that na = Op for everya E R. 

(b) If Ris an integral domain, prove that n is prime. 


C. 44. (a) Let a and b be nilpotent elements in a commutative ring R (see 
Exercise 40). Prove that a + b and ab are also nilpotent. [You will need the 
Binomial Theorem from Appendix E.] 


(b) Let N be the set of all nilpotent elements of R. Show that N is a subring 
of R. 


45. Let R be a ring such that x? = x for every x R. Prove that R is commutative. 


46. Let R be a nonzero finite commutative ring with no zero divisors. Prove that 
Risa field. 


Ez Isomorphisms and Homomorphisms 


If you were unfamiliar with roman numerals and came across a discussion of integer 
arithmetic written solely with roman numerals, it might take you some time to realize 
that this arithmetic was essentially the same as the familiar arithmetic in Z except for 
the labels on the elements. Here is a less trivial example. 


EXAMPLE 1 


Consider the subset S = {0, 2, 4, 6, 8} of Zo. With the addition and multiplica- 
tion of Zj9, S is actually a commutative ring, as can be seen from these tables:* 


*The reason the elements of S are listed in this order will become clear in a moment. 
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+ 0 6 2 8 4 0 6 2 8 4 
0 0 6 2 8 4 0 0 0 0 0 0 
6 6 2 8 4 0 6 0 6 2 8 4 
2 2 8 4 0 6 2 0 2 4 6 8 
8 8 4 0 6 2 8 0 8 6 4 2 
4 4 0 6 2 8 4 0 4 8 2 6 


A careful examination of the tables shows that S is a field with five elements and that 
the multiplicative identity of this field is the element 6. 

We claim that S is “essentially the same” as the field Z, except for the labels on the 
elements You can see this as follows. Write out addition and multiplication tables 


Z, by 0, 1, 2, 3, 4. Then relabel the entries in the Z, tables according to this scheme: 


Relabel 0 as 0, relabel 1 as 6, relabel 2 as 2, 
relabel 3 as 8, relabel 4 as 4. 


Look what happens to the addition and multiplication tables for Zs: 


0 8| 4 
wile I Z ile 
o Oe . BL a 
5 |p Z Z |a 
6 6| 4 0 
Y Z Z A l 
-A A o 6 
z Z 2 5 |Y 
8l 8 62 
7 Z A I Z 
4 4j -2 8 
A JA |B : PB 


By relabeling the elements of Zs, you obtain the addition and multiplication 
tables for S. Thus the operations in Z; and S work in exactly the same way—the 
only difference is the way the elements are labeled. As far as ring structure goes, 
S is just the ring Z, with new labels on the elements. In more technical terms, Zs 
and S are said to be isomorphic. 


In general, isomorphic rings are rings that have the same structure, in the sense that 
the addition and multiplication tables of one are the tables of the other with the ele- 
ments suitably relabeled, as in Example 1. Although this intuitive idea is adequate for 
small finite systems, we need a rigorous mathematical definition of isomorphism that 
agrees with this intuitive idea and is readily applicable to large rings as well. 

There are two aspects to the intuitive idea that rings R and S are isomorphic: 
telabeling the elements of R and comparing the resulting tables with those of S to 
verify that they arethe same. Relabeling means that every element of R is paired with 
a unique element of S (its new label). In other words, there is a function f:R — S that 


*The Z, tables (in congruence class notation) are shown in Example 2 of Section 2.2. 
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assigns to each r ER its new label f(r) € S. In the preceding example, we used the rela- 
beling function f: Z; — S, given by 


fH=0 fD=6 fB=2 fB)=8 f= 
Such a function must have these additional properties: 
(i) Distinct elements of R must get distinct new labels: 
If r + r’ in R, then f(r) # f(r’) in S. 


(ii) Every element of S must be the label of some element in R:* 
For each s E S, there is an rE R such that f(r) = s. 


Statements (i) and (ii) simply say that the function f must be both injective and surjec- 
tive, that is, f must be a bijection.‘ 

In order for a bijection (relabeling scheme) f to be an isomorphism, applying f to 
the addition and multiplication tables of R must produce the addition and multiplica- 
tion tables of S. So if a + b = c in the R-table, we must have f(a) + f(b) = f(c) in the 
S-table, as indicated in the diagram: 


kaf 6 s +| fe) 


Si ~ po 


— puemi 


However, since a + b = c, we must also have f(a + b) = f(c). Combining this with the 
fact that f(a) + f(b) = f(c), we see that 


fla + b) = f(a) + fb). 


This is the condition that f must satisfy in order for f to change the addition tables 
of R into those of S. The analogous condition on f for the multiplication tables is 
J (ab) = f(a) f(b). We now can state a formal definition of isomorphism: 


Definition A ring R is isomorphic to a ring S (in symbols, R = S) if there is a function 


f:R — S such that 
(i) fis injective; 
(ii) fis surjective; 
(iii) f(a + b) = f(a) + f(b) and f(ab) = f(a) f(b) for alla, DER. 


In this case the function f is called an isomorphism. 


*Otherwise, we couldn’t possibly get the complete tables of S from those of R. 
tinjective, surjective, and bijective functions are discussed in Appendix B. 
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CAUTION: In order to be an isomorphism, a function must satisfy aX 
three of the conditions in the definition. It is quite possible 
for a function to satisfy any two of these conditions but not 
the third; see Exercises 4, 25, and 32. 


EXAMPLE 2 


In Example 12 on page 50, we considered the field K of all 2 x 2 matrices of 


the form 
b a r 


where a and b are real numbers. We claim that K is isomorphic to the field 
C of complex numbers. To prove this, define a function f: K —> C by the 


rule 
a b A 
nE a =a + bi. 


To show that f is injective, suppose 


a b r s 
AG SSe 
Then by the definition of f, a + bi = r + si in C. By the rules of equality in C, 
we must havea = r and b = s. Hence, in K 


( a 2 e ( f y 

-b a -s rf 

so that f is injective. The function fis surjective because any complex number 
a + bi is the image under f of the matrix 


ep 
>p f 
in K. Finally, for any matrices A and B in K, we must show that f(A + B) = 
f(A) + f(B) and (4B) = f(A) f(B). We have 


AUS eli Deets mee) 


=(a+c)+ (b+ di 
= (a + bi) + (c + di) 


l SE SF 
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and 


Me ohn |e ea 
= (ac — bd) + (ad + bc)i 
= (a+ bi)(e + di) 


a5 aly a 


Therefore, fis an isomorphism. 


It is quite possible to relabel the elements of a single ring in such a way that the ring 
is isomorphic to itself. 


EXAMPLE 3 


Let fC — C be the complex conjugation map given by f(a + bi) = a—bi.* The 
function f satisfies 


fila t+ bi) + (e+ d)J=f[(at+e) + (b+ di] 
= (a + ¢) — (b+ di = (a — bi) + (c — di) 
=f(a+ bi) +f(e+ di) 
and 
fila + bi)(e + di)] = f[(ac — bd) + (ad + be)i] 
= (ac — bd) — (ad + be)i = (a — bi)(c — di) 
=f (a + bi)f (e+ di). 


You can readily verify that fis both injective and surjective (Exercise 17). 
Therefore f is an isomorphism. 


EXAMPLE 4 
If R is any ring and tg:R — R is the identity map given by tp(r) = r, then for 
anya,beER 
tr (a + b) =a +b = egla) + tb) and tp(ab) = ab = ua)uglb). 


Since tp is obviously bijective, it is an isomorphism. 


Our intuitive notion of isomorphism is symmetric: “R is isomorphic to S” means 
the same thing as “S is isomorphic to R”. The formal definition of isomorphism is not 


*The function f has a geometric interpretation in the complex plane, where a + bi is identified with 
the point (a, 5): It reflects the plane in the x-axis. 
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symmetric, however, since it requires a function from R onto S but no function from 
S onto R. This apparent asymmetry is easily remedied. If f:R — S is an isomorphism, 
then fis a bijective function of sets. Therefore, fhas an inverse function g:S — R such 
that g ° f = tp (the identity function on R) and f° g = t,* It is not hard to verify that 
the function g is actually an isomorphism (Exercise 29). Thus R = S implies that 
S = R, and symmetry is restored. 


Homomorphisms 


Many functions that are not injective or surjective satisfy condition (iii) of the definition 
of isomorphism. Such functions are given a special name. 


Definition Let R and S be rings. A function £R — S is said to be a homomorphism if 
f(a + b) = f(a) + f(b) and f(ab) = f(a}f(b) foralla,beR. 


Thus every isomorphism is a homomorphism, but as the following examples show, 
a homomorphism need not be an isomorphism because a homomorphism may fail to 
be injective or surjective. 


EXAMPLE 5 


For any rings R and S the zero map z:R > S given by 2(r) = 0. for every rE R is 
a homomorphism because for any a, bE R 


z(a + b) = Os = Os + 0g = z(a) + z(b) 


and 
z(ab) = Os = Og: 05 = 2(a)z(5). 


When both R and S contain nonzero elements, then the zero map is neither 
injective nor surjective. 


EXAMPLE 6 


The function f:Z — Zs given by f(a) = [a] isa homomorphism because of the 
way that addition and subtraction are defined in Z,: for any a, bE Z 


f(a + b) = [a + b] = [a] + [b] = fa) + f(b) 
and 


S(ab) = [ab] = [alib] = f(a)f(e). 
The homomorphism f is surjective, but not injective (Why?). 


*See Appendix B for details. 
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EXAMPLE 7 


The map g:R — M(R) given by 


saals 


is a homomorphism because for any r, sE R 


seat PS al orraa 


F ee s) r : ) are 


erat) =(2 TVS S\=({, 2) = etn. 


The homomorphism g is injective but not surjective (Exercise 26). 


CAUTION: Not all functions are homomorphisms. The properties 
flat b)=fla)+fO®) and flab) = fia)fib) 
fail for many functions. For example, if f:R — R given by 
f(x) = x + 2, then 
fB+H=fM=9 but 93) +f(4)=54+6=11 


so that f(3 + 4) # f(3) + /(4). Similarly, f(3 - 4) # f(3) K4) 
because 


f3°4)=f(12)= 14, bt =f) f(4) = 5-6 = 30. 


Theorem 3.10 


Let f:R — S be a homomorphism of rings. Then 


(1) fp) = Os 
(2) f(a) = —f(a) for every a ER. 
(3) f(a — b) = f(a) — f(b) for all a, DER. 


If Ris a ring with identity and f is surjective, then 


(4) S is a ring with identity f(1,). 
(5) Whenever u is a unit in R, then f(u) is a unit in S and f(u)! = f(u’). 
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Proof» (1) (Og) + (Og) = fOr + Op) [f is a homomorphism.) 
fOr) + f(Oz) = f(z) [Or + Og = Ogin R] 
SOR) + fOr) = fOr) + Os [f(Og) + 0s = f(Og) in S] 
fOr) = 0s [Subtract f(0,) from both sides). 


(2) First, note that 
f@ +f(-a) =f(a+(—a)) [f isahomomorphism.] 
= f(Op) [a + (—a) = 0p] 
=0, [Part (2)}. 
Therefore, f(—a) is a solution of the equation f(a) + x = Os. But the 
unique solution of this equation is —f(a) by Theorem 3.3. Hence 


S(—a) = —f(a) by uniqueness. 


(3) f(a — b) = f(a + (—d)) [Definition of subtraction] 
= f(a) + f(-d)) Lf is a homomorphism. | 
= f(a) + (-f) [Part (2)] 
= f(a) -f(b) [Definition of subtraction]. 


(4) We shall show that f(1})E S is the identity element of S. Let s 
be any element of S. Then since fis surjective, s = f(r) for some rE R. 
Hence, 


5° f(r) SAAD Sfr a) = fO = 
and, similarly, f(1,) + s = s. Therefore, S has f(1 p) as its identity element. 


(5) Since u is a unit in R, there is an element v in R such that 
uv = lp = vu. Hence, by (4) 


SFO) = fur) = f(x) = 1s 


Similarly, vu = 1p implies that f(v)f(u) = 15. Therefore, f(u) is a unit in 
S, with inverse f(v). In other words, f(u)~! = f(v). Since v = u7!, we see 


that f(u) = f) = fe"). m 
If f:R > Sis a function, then the image of fis this subset of S: 
Imf = {se S| s = f(r) for some re R} = (f(r) | re R}. 


If f is surjective, then fn f = S by the definition of surjective. In any case we have: 


Corollary 3.11 


If FR — S is a homomorphism of rings, then the image of f is a subring of S. 


Proof » Denote Im by I. Iis nonempty because 0; = f(0p) E I by (1) of Theorem 3.10. 
The definition of homomorphism shows that J is closed under multiplica- 
tion: If f(a), f(b) EF, then f(a) f(b) = f(ab) EL. Similarly, Z is closed under 
subtraction because f(a) — f(b) = f(a — b) € I by Theorem 3.10. Therefore, I 
is a subring of S by Theorem 3.6. E 
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Existence of lsomorphisms 


If you suspect that two rings are isomorphic, there are no hard and fast rules for 
finding a function that is an isomorphism between them. However the properties of 
homomorphisms in Theorem 3.10 can sometimes be helpful. 


EXAMPLE 8 


If there is an isomorphism f from Z,, to the ring Z, X Z,, then f(1) = (1, 1) by 
part (4) of Theorem 3.10. Since fis a homomorphism, it has to satisfy 

f(D =f + 1) = fC) +) = (1, 1) + (1, 1) = (2, 2) 

SG) = f(2 + D = f(2) + f1) = (2, 2) + (1, 1) = (0, 3) 

F(4) =f + 1) =) +f) = ©, 3) + (1, 1) = (1, 0). 


Continuing in this fashion shows that if f is an isomorphism, then it must be 
this bijective function: 


JM=(L0) f4=0,0 AD=(1,3) — f(10) = 1, 2) 
fQ@=(22) f6)=21) f8%=@2,0 fdl)= (2,3) 
£3) = (0,3) =f) =@,2) f(9=(, 1) FO) = (0, 0). 


All we have shown up to here is that this bijective function fis the only possible 
isomorphism. To show that this factually is an isomorphism, we must verify 
that it is a homomorphism. This can be done either by writing out the tables 
(tedious) or by observing that the rule of f can be described this way: 


flah) = (als, [al,); 


where [a], denotes the congruence class of the integer a in Z,9, [a]; denotes the 
class of a in Z;, and [a], the class of a in Z4. (Verify that this last statement is 
correct.) Then 


fidu + [4] 2) = fla + bl) [Definition of addition in Z,7]| 
= (fa + blh, [a + b]4) [Definition of f) 
= ({a]3 + [b];, [ala + [b]4) [Definition of addition in Z; and Z4] 
= (a]3, [a]4) + ((b]s, [b]4) [Definition of addition in Z3 X Za 
= fidi) + fbl) [Definition of f). 
An identical argument using multiplication in place of addition shows that 


S (lah bhi) = £(lahi2)f((b]12). Therefore, fis an isomorphism and Zy, = Z3 X Z4. 


Up to now we have concentrated on showing that various rings are isomorphic, 
but sometimes it is equally important to demonstrate that two rings are not isomorphic. 
To do this, you must show that there is no possible function from one to the other 
satisfying the three conditions of the definition. 
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EXAMPLE 9 


Z; is not isomorphic to Z,, or to Z because it is not possible to have a surjective func- 
tion from a six-element set to a larger set (or an injective one from a larger set to Z6). 


To show that two infinite rings or two finite rings with the same number of elements 
are not isomorphic, it is usually best to proceed indirectly. 


EXAMPLE 10 


The rings Z, andZ, X Z, are not isomorphic. To show this, suppose on the 
contrary that f:Z4 — Z, x Z, is an isomorphism. Then /(0) = (0, 0) and 
S() = (1, 1) by Theorem 3.10. Consequently, 


SQ =f + 1) =f) + fC) = (1, 1) + (1, 1) = (0, 0). 


Since f is injective and /(0) = /(2), we have a contradiction. Therefore, no 
isomorphism is possible. 


Suppose that f:R — S is an isomorphism and the elementsa, b,c, . .. of R have a par- 
ticular property. If the elements f(a), f(b), f(c), . . . of S have the same property, then we 
say that the property is preserved by isomorphism. According to parts (1), (4), and (5) of 
Theorem 3.10, for example, the property of being the zero element or the identity element 
or a unit is preserved by isomorphism. A property that is preserved by isomorphism can 
sometimes be used to prove that two rings are not isomorphic, as in the following examples. 


EXAMPLE 11 


In the ring Z, the elements 1, 3, 5, and 7 are units by Theorem 2.10. Since 
being a unit is preserved by isomorphism, any isomorphism from Z, to another 
ring with identity will map these four units to four units in the other ring. 
Consequently, Zg is not isomorphic to any ring with less than four units. In 
particular, Z, is not isomorphic to Z, X Z, because there are only two units in 
this latter ring, namely (1, 1) and (3, 1) as youcan readily verify. 


EXAMPLE 12 


None of @, R, or C is isomorphic to Z because every nonzero element in the 
fields @, R, and C is a unit, whereas Z has only two units (1 and —1). 


EXAMPLE 13 


Suppose R is a commutative ring and f: R > S is an isomorphism. Then for any 
a,b ER, we have ab = ba in R. Therefore, in S 


SS) = flab) = fiba) = flb)f(a). 
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Hence, S is also commutative because any two elements of S are of the form f(a), 

f(b) (since f is surjective). In other words, the property of being a commutative 
ring is preserved by isomorphism. Therefore, no commutative ring can be iso- 
morphic to a noncommutative ring. 


E Exercises 


A. 1. Let f:Z,-+Z, X Z, be the bijection given by 
0—0,0) 1301), 2 3(0,2) 3-(1,0), 
4> (0, 1), 5— (1, 2). 


Use the addition and multiplication tables of Z and Z, X Z; to show that f is 
an isomorphism. 


2. Use tables to show that Z, X Z} is isomorphic to the ring R of Exercise 2 in 
Section 3.1. 


3. Let R be a ring and let R* be the subring of R X R consisting of all elements 
of the form (a, a). Show that the function f:R — R* given by f(a) = (a, a) is an 
isomorphism. 


as in Example 1). Show that the following bijection from Z, to S is not an 
isomorphism: 
0—0 I1—+2 2—>4 3—6 4—8. 
5. Prove that the field R of real numbers is isomorphic to the ring of all 2 x 2 
matrices of the form Cj : ), with a ER. [Hint: Consider the function f given 


ra= (o 21 


aA) 

6. Let R and S be rings and let R be the subring of R X S consisting of all 
elements of the form (a, 05). Show that the function f:R — R given by 
f(a) = (a, 05) is an isomorphism. 


7. Prove that R is isomorphic to the ring S of all 2 X 2 matrices of the form 


(¢ o where aR. 
0 a 


8. Let a( v2) be as in Exercise 39 of Section 3.1. Prove that the function 
FA(V2) > Q(V2) given by f(a + 6V2) = a — bV2 is an isomorphism. 
9, If f:Z — Z is an isomorphism, prove that f is the identity map. [Hint: What 
are f(1), f(1 + 1),...2] 
10. If R is a ring with identity and f:R — S isa homomorphism from R to a 


ring S, prove that f(1,) is an idempotent in S. [Idempotents were defined in 
Exercise 3 of Section 3.2.] 
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11. State at least one reason why the given function is not a homomorphism. 
(a) FR >R and f(x) = Vx. 
(b) g:E— E, where E is the ring of even integers and f(x) = 3x. 
(c) ER >R and f(x) = 2”. 
(d) k:Q —> Q, where k(0) = 0 and (5) = bir a+0. 


12. Which of the following functions are homomorphisms? 
(a) f:Z > Z, defined by f(x) = —x. 
(b) f:Z, > Z,, defined by f(x) = —x. 
(c) z: — Q, defined by g(x) = z - T 

(d) #:R => M(R), defined by h(a) = pe! a, 

(e) f:Z12 > Za, defined by f([x]i2) = [x], where [u], denotes the class of the 
integer u in Z,,. 


13. Let R and S be rings. 


(a) Prove that f:R X S— R given by f(r, s)) = ris a surjective homomorphism. 
(b) Prove that g:R X S — S given by g((r, s)) = s is a surjective homomorphism. 


(c) If both R and S are nonzero rings, prove that the homomorphisms f and g 
are not injective. 


14. Let fZ — Z, be the homomorphism in Example 6. Let K = {ae€Z | f(a) = [0]}. 
Prove that K isa subring of Z. 


15. Let f:R — S be a homomorphism of rings. If r is a zero divisor in R, is f(r) a 
zero divisor in S? 


B.16. Let T, R, and F be the four-clement rings whose tables are given in Example 5 
of Section 3.1 and in Exercises 2 and 3 of Section 3.1. Show that no two of 
these rings are isomorphic. 


17. Show that the complex conjugation function f:C — C (whose rule is 
S@ + bi) = a— bi) is a bijection. 


18. Show that the isomorphism of Z, and S in Example 1 is given by the function 
whose rule is f([x]s) = [6x]io (notation as in Exercise 12(e)). Give a direct 
proof (without using tables) that this map is a homomorphism. 


19. Show that S = £0, 4, 8, 12, 16, 20, 24} is a subring of 7 ,. Then prove that the 
mapf:Z, > S given by f([x],) = [8x]: is an isomorphism. 


20. Let E be the ring of even integers with the + multiplication defined in 
Exercise 23 of Section 3.1. Show that the map f:E > Z given by f(x) = x/2 is 
an isomorphism. 


21. Let Z* denote the ring of integers with the @ and © operations defined in 
Exercise 22 of Section 3.1. Prove that Z is isomorphic to 2*. 
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22. Let Z denote the ring of integers with the ® and © operations defined in 
Exercise 24 of Section 3.1. Prove that Z is isomorphic to Z. 


23. Let C be the field of Exercise 45 of Section 3.1. Show that C is isomorphic to 
the field C of complex numbers. 


24. (a) Let R be the set R X R with the usual coordinatewise addition, as in 


Theorem 3.1. Define a new multiplication by the rule (a, b)(c, d) = 
(ac, be). Show that R is a ring. 


(b) Show that the ring of part (a) is isomorphic to the ring of all matrices in 


a 0 
M(R) of the form k a 


0 
25. Let L be the ring of all matrices in M(Z) of the form i a! Show that the 


3 ; 0 F i : 
function f:L — Z given by H ) = a is a surjective homomorphism but 
not an isomorphism. c 


26. Show that the homomorphism g in Example 7 is injective but not surjective. 


27. (a) If g:R — S and f:S > T are homomorphisms, show that f° g:R > Tis a 
homomorphism. 


(b) If fand g are isomorphisms, show that f° g is also an isomorphism. 


28. (a) Give an example of a homomorphism f:R — S such that R has an identity 
but S does not. Does this contradict part (4) of Theorem 3.10? 


(b) Give an example of a homomorphism f:R — S such that S has an identity 
but R does not. 

29. Let f:R — S be an isomorphism of rings and let g:S — R be the inverse 
function of f (as defined in Appendix B). Show that g is also an isomorphism. 
[Hint: To show g(a + b) = g(a) + g(b), consider the images of the left- and 
right-hand side under f and use the facts that fis a homomorphism and f° g is 
the identity map.) 

30. Let f:R — S bea homomorphism of rings and let K = {rE R| f(r) = 0s}. 
Prove that K isa subring of R. 


31. Let f:R > S bea homomorphism of rings and T a subring of S. 
Let P = {rE R| f(r) ET}. Prove that P is a subring of R. 


32. Assume n = 1 (mod m). Show that the function f:2,, > Zm given by 
Sdn) = Pxm is an injective homomorphism but not an isomorphism when 
n = 2 (notation as in Exercise 12(e)). 


33. (a) Let T be the ring of functions from R to R, as in Example 8 of Section 3.1. 
Let 6:7 — R be the function defined by 8( f) = f(5). Prove that 9 is a 
surjective homomorphism. Is 9 an isomorphism? 


(b) Is part (a) true if 5 is replaced by any constant cE R? 


34. If f:R — S is an isomomorphism of rings, which of the following properties 
are preserved by this isomorphism? Justify your answers. 


(a) aE Ris a zero divisor. 
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(b) a ER is idempotent.* 
(c) Ris an integral domain. 


35. Show that the first ring is not isomorphic to the second. 


(a) Eand Z M) RXR x R x Rand M(R) 
(c) £4 X Z and Ze (d) Q and R 
(e) Z X Z and Z (f) Z, X Z, and Zi, 


36. (a) If f:R — S is a homomorphism of rings, show that for any r € R and 
ned, f(nr) = nf(r). 
(b) Prove that isomorphic rings with identity have the same characteristic. 
[See Exercises 41—43 of Section 3.2.] 


(c) If f:R— S isa homomorphism of rings with identity, is it true that R and 
S have the same characteristic? 


37. (a) Assume that e is a nonzero idempotent in a ring R and that e is not a zero 
divisor.* Prove that e is the identity element of R. [Hint: ê = e (Why?). If 
a E R, multiply both sides of e? = e by a.] 


(b) Let S be a ring with identity and T a ring with no zero divisors. Assume 
that f:S — T is a nonzero homomorphism of rings (meaning that at least 


one element of S is not mapped to 07). Prove that /(15) is the identity 
element of T. [Hint: Show that f(15) satisfies the hypotheses of part (a).] 


38. Let F be a field and f:F + R a homomorphism of rings. 


(a) If there is a nonzero element ¢ of F such that f(c) = Og, prove that fis 
the zero homorphism (that is, f(x) = 0g for every x€ F). [Hint: c! exists 
(Why?). If xE F, consider f(xee").] 


(b) Prove that f is either injective or the zero homomorphism. [Hint: If fis not 
the zero homomorphism and f(a) = f(b), then f(a — b) = 0,.] 

39. Let R be a ring without identity. Let T be the ring with identity of Exercise 32 
in Section 3.2. Show that R is isomorphic to the subring R of T. Thus, if R is 
identified with R, then R is a subring of a ring with identity. 

C. 40. For each positive integer k, let KZ denote the ring of all integer multiples of k (see 
Exercise 6 of Section 3.1). Prove that if m + n, then mZ is not isomorphic to nZ. 


41. Let m, n E Z with (m, n) = 1 and let f: Z mu > Zm X Z, be the function given 
by f([aloun) = (ldm, [a],). (Notation as in Exercise 12(e). Example 8 is the case 
m=3,n=4,) 

(a) Show that the map f is well defined, that is, show that if [a], = [b],., in 
Za» then [a], = [b],, in Z,, and [a], = [5], in Z,. 

(b) Prove that fis an isomorphism. [Hint: Adapt the proof in Example 8: the 
difference is that proving f is a bijection takes more work here.] 


42. If (m,n) + 1, prove that Zm» is not isomorphic to Zm X Zp. 


“Idempotents are defined in Exercise 3 of Section 3.2. 
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CHAPTER 4 


Arithmetic in FLX] 


In Chapter 1 we examined grade-school arithmetic from an advanced standpoint 
and developed some important properties of the ring Z of integers. In this chapter 
we follow a parallel path, but the starting point here is high-school algebra—- 
specifically, polynomials with coefficients in the field R of real numbers, such as 


xX-x-5 -x + 7r+4, x? -1. 
Dealing with polynomials means dealing with the mysterious symbol "x", which 
is used in three different ways in high-school algebra. First, x often "stands for" a 
number, as in the equation 12x — 8 = 0, where x is the number —. Second, x some- 


times doesn’t seem to stand for any particular number but is treated as if it were a 
number in simplification exercises such as this one: 


+x xP +1) 
L+ eet 


Third, x is also used as the variable in the rules of functions such as f(x) = 3x + 5. 

Now that you know what rings and fields are, we shall consider polynomials 
with coefficients in any ring and attempt to clear up some of the mystery about 
the nature of x. In Sections 4.1-4.3, we shall see that when x is given a meaning 
similar to the second way it is used in high school, then the polynomials with coef- 
ficients in a field F form a ring (denoted F[x]) whose structure is remarkably similar 
to that of the ring Z of integers. In many cases the proofs for Z given in Chapter 1 
carry over almost verbatim to F[x]. 

In Sections 4.4-4.6 we consider tests to determine whether a polynomial is irre- 
ducible (the analogue of testing an integer for primality). Here the development is 
not an exact copy of what was done in the integers. The reason is that the polyno- 
mial ring F[x] has features that have no analogues in the ring of integers, namely, 
the concepts of the root of a polynomial and of a polynomial function (which cor- 
respond to the first and third uses of x in high school). 

85 
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41 Polynomial Arithmetic and the Division Algorithm 


The underlying idea here is to define “polynomial” in a way that is the obvious exten- 
sion of polynomials with real-number coefficients. Let R be any ring. A polynomial 
with coefficients in R is an expression of the form 

Gy + ax + ax? +++ +a, x", 


where # is a nonnegative integer and.a,€ R. 

This informal definition raises several questions: What is x? Is it an element of R? 
If not, what does it mean to multiply x by a ring element? In order to answer these 
questions, note that an expression of the form ay + ajx + ay? +--+ + a,x" makes 
sense, provided that the a, and x are all elementsof some largerring. An analogy might 
be helpful here. The number 77 is not in the ring Z of integers, but expressions such as 
3 — 4r + 1207? + m and 8 — 7? + 677° make sense in the real numbers. Furthermore, 
it is not difficult to verify that the set of all numbers of the form 

atam + am +- +a, withn=O0andaeZ 


is a subring of R that contains both Z and a (Exercise 2). 

For the present we shall think of polynomials with coefficients in a ring R in much 
the same way, as elements of a larger ring that contains both R and a special element 
x that is not in R. This is analogous to the situation in the preceding paragraph with 
R in place of Z and x in place of 7, except that here we don’t know anything about 
the element x or even if such a larger ring exists. The following theorem provides the 
answer, as well as a definition of “polynomial”. 


Theorem 4.1 


If R is a ring, then there exists a ring 7 containing an element x that is not in 
R and has these properties: 
(i) R is a subring of 7. 
(ii) xa = ax for every a ER. 
(iii) The set Afr] of all elements of 7 of the form 
& tax +a? +s: tap (where n = Oand ach) 
is a subring of 7 that contains R. 
(iv) The representation of elements of R[x] is unique: If n = mand 
y+ ax + ag? +++ tae = Dy + bix + bx? + tba, 
then a, = b, for i = 1,2,...,n and b; = Og for each j >n. 
(v) ao + aix + agx? + ++ + ap? = Op if and only if a, = Og for every i. 


Proof» See Appendix G. We shall assume Theorem 4.1 here W 


The elements of the ring R[x] in Theorem 4.1 (ili) are called polynomials with 
coefficients in R and the elements a, are called coefficients. The special element x is 
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sometimes called an indeterminate.* To avoid any misunderstandings in Theorem 4.1, 
please note the following facts. 


1. Property (ii) of Theorem 4.1 does not imply that the ring T is commutative, but 
only that the special element x commutes with each element of the subring R 
(whose elements may not necessarily commute with each other). 


2. Property (v) is the special case of property (iv) when each 5, = Og. 


3. The first expression in property (v) is not an equation to be solved for x. In this 
context, asking what value of x makes ag + a,x + ax? +--+ + a,x" = Op is as 
meaningless as asking what value of m makes 3 + 5a — 77? = 0 because x (like 
T) is a specific element of a ring, not a variable that can be assigned values.‘ 


EXAMPLE 1 


The rings Z[x], Q[x], and R[x] are the rings you are familiar with from high 
school. For instance, 3 + 5x — 7x’ is in all three of these rings, but 3 + 7.5x? is 
only in Q[x] and R[x] because the coefficient 7.5 is not an integer. Similarly, 
4.2 + 3x + V5x* is in R[x] but not in the other two rings since V5 is not a 
rational number. Terms with zero coefficents are usually omitted, as they were 
in the preceding sentence. 


EXAMPLE 2 


Let E be the ring of even integers. Then 4 — 6x + 4x’ € E[x]. However, the 
polynomial x is not in [x], because it cannot be written with even coefficients. 


Polynomial Arithmetic 
The rules for adding and multiplying polynomials follow directly from the fact that 
Rix] is a ring, 


EXAMPLE 3 


If f(x) = 1 + 5x — xX? + 423 + 2x4 and g(x) = 4 + 2x + 3x7 + x3 in Z,[x], then 
the commutative, associative, and distributive laws show that 
Sx) + g(x) = 1 + 5x — X + 423 + 4) + (4 + 2x + 3x? + 2 + O24) 
= (1+ 4) + (5 + 2)x + (-1 + 3)x? + (4 + 1)? + (2 + 0)x4 
=5 + 0x + 2x7 + 5x3 + 2x4 = 5 + 2? + 5x3 + 2x4, 


“Although in common use, the term “indeterminate” is misleading. As shown in Appendix G, there 
is nothing undetermined or ambiguous about x. It is a specific element of the larger ring 7 and is 
not an element of R. 


tVariables and equations will be dealt with in Section 4.4. 
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EXAMPLE 4 


The product of 1 — 7x + x?and 2 + 3x in Q[x] is found by using the distribu- 
tive law repeatedly: 
(1 — 7x + (2 + 3x) = 1(2 + 3x) — 7x(2 + 3x) + x°(2 + 3x) 
= 1(2) + 13x) — 7x(2) — 7x(3x) + (2) + 7° BX 
= 2+ 3x — 14x — 21x? + 2x? + 3x 
=2—- 11x — 19x? + 3x. 


The preceding examples are typical of the general case. You add polynomials by 
adding the corresponding coefficients, and you multiply polynomials by using the 
distributive laws and collecting like powers of x. Thus polynomial addition is given by 
the rule:* 


(Gy + ax + ay? + +++ + apx") + (bo + bix + gx? +--+ + b,x”) 
= (ag + bo) + (a, + By)x + (a, + by? + +++ + (a, + 5)" 


and polynomial multiplication is given by the rule: 
(a + ax + ax? + +++ + a,x) (by + bix + Box? + +++ + b,x) 
= abo + (aob; + aybg)x + (aob + aby + mbo)? + +++ + apb ™. 
For each k = 0, the coefficient of x* in the product is 


k 
agbs + ayby_y + abra +t + apab + apb + arbo = Bates 


where q = Ogif i > n and b, = Opif j > m. 

It follows readily from this description of multiplication in R[x] that if R is com- 
mutative, then so is R[x] (Exercise 7). Furthermore, if R has a multiplicative identity 
1g; then 1, is also the multiplicative identity of R[x] (Exercise 8). 


Definition Let f(x) = & + ax + ax? + +++» + a,x" be a polynomial in R[x] with ap # Op. 
Then a, is called the leading coefficient of f(x). The degree of f(x) is the 
integer n; it is denoted “deg f(x)". In other words, deg f(x} is the largest 
exponent of x that appears with a nonzero coefficient, and this coefficient 
is the leading coefficient. 


EXAMPLE 5 


The degree of 3 — x + 4x? — 7x3 € R[x] is 3, and its leading coefficient is ~7. 
Similarly, deg (3 + 5x) = 1 and deg (x!) = 12. The degree of 2 + x + 4x7 — 
0x? + Ox* is 2 (the largest exponent of x with a nonzero coefficient); its leading 
coefficient is 4. 


*We may assume that the same powers of x appear by inserting zero coefficients where necessary. 
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The ring R that we start with is a subring of the polynomial ring R[x]. The elements 
of R, considered as polynomials in R[x], are called constant polynomials. The polyno- 
mials of degree 0 in R[x] are precisely the nonzero constant polynomials. Note that 


the constant polynomial 0, does not have a degree 


(because no power of x appears with nonzero coefficient). 


Theorem 4,2 


If R is an integral domain and f(x), g(x) are nonzero polynomials in R[x], then 
deg[f(x)g(x)] = deg f(x) + deg g(x). 


Proof» Suppose f(x) = dy + ax + ay? + +++ + a,x" and g(x) = by + bix + 
bx? +++ +5,x" with a, + Ogand bm # Og, so that deg f(x) = n and 
deg g(x) = m. Then 


S(%)B(X) = aobo + (aob; + aibo)x + (Gaby + aibi + Agby)x? + +++ + a,b, 


The largest exponent of x that can possibly have a nonzero coefficient is 
n + m. But a,b, # Og because R is an integral domain anda, # Og and 
bm # Og. Therefore, f(x)g(x) is nonzero and deg[f(x)g(x)] = n + m= 
deg f(x) + deg g(x). E 


Corollary 4.3 


If R is an integral domain, then so is R[x]. 


Proof» since R is a commutative ring with identity, so is R[x] (Exercises 7 and 8). 
The proof of Theorem 4.2 shows that the product of nonzero polynomials 
in R[x] is nonzero. Therefore, R[x] is an integral domain. W 


The first five lines of the proof of Theorem 4.2 are valid in any ring and lead to 
this conclusion. 


Corollary 4.4 
Let R be a ring. If f(x), g(x), and f(x)g(x) are nonzero in A{x], then 
deg [f(x)9(x)] = deg f(x) + deg g(x). 
EXAMPLE 6 
In Zdx, let f(x) = 2x4 and g(x) = 5x. Then f(x)g(x) = (2x*)(5x) = 4x, 
so deg [f(x)g(x)] = deg f(x) + deg g(x). However, if g(x) = 1 + 3x’, then 
SOA) = 2x4(1 + 3x4) = 2x4 + 2+ 3x° = 2x4 + Ox = 2x4, 


which has degree 4. But deg f(x) + deg g(x) = 6. So deg [f(x)g(x)] < deg f(x) + 
deg g(x). 
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For information on the degree of the sum of polynomials, see Exercises 4 and 12. 


Corollary 4.5 
Let R be an integral domain and f(x) € R[x). Then 
f(x) is a unit in Rix] if and only if f(x) is a constant polynomial that is a unit in R. 


In particular, if F is a field, the units in F[x] are the nonzero constants in £. 


Remember that the proof of an “if and only if” statement requires two separate proofs. 


Proof of Corollary 45> First, assume that f(x) is a unit in R[x]. Then f(x)g(x) = 1r 
for some g(x) in R[x]. By Theorem 4.2, 


deg f(x) + deg g(x) = deg [/(x)g(x)] = deg 1p = 0. 


Since the degrees of polynomials are nonnegative, we must have 
deg f(x) = 0 and deg g(x) = 0. Therefore, f(x) and g(x) are constant poly- 
nomials, that is, constants in R. Since f(x)g(x) = lg, f(x) is a unit in R. 
Conversely, assume that f(x) is a constant polynomial that isa unitin R, 
say f(x) = b, with b a unit in R. Let h(x) = 67). Then Ah) = bb" = 1g. 
Therefore, f(x) is a unit in R[x]. 
The last statement of the corollary follows immediately since 
every nonzero element of a field is a unit in the field (see Example 6 in 
Section 3.2). @ 


EXAMPLE 7 


The only units in Z[x] are 1 and —1, since these are the only units in Z. The units in 
R[x] (or in Q[x] or in C[x]) are all nonzero constants, since R, Q, and C are fields. 


Corollary 4.5 may be false if R is not an integral domain (Exercise 11). 


EXAMPLE 8 


5x + 1 is a unit in Z,,[x] that is not a constant because (as you should verify) 
(5x + 1)(20x + 1) =1. 


The Division Algorithm in F[x] 
Our principal interest in the rest of this chapter will be polynomials with coefficients in 
a field F (such as Q or R or Z,). As noted in the chapter introduction, the domain F[x] 
has many of the same properties as the domain Z of integers, including the Division 
Algorithm (Theorem 1.1), which states that for any integers a and b with b positive, 
there exist unique integers g and r such that 


a=bqg+r and Osr<b. 
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For polynomials, the only changes are to require the divisor to be nonzero and to 
teplace the statement “0 < r < b” by a statement involving degrees. Here is the formal state- 
ment (with f(x) in place of a, g(x) in place of b, and q(x), r(x) in place of q, r respectively). 


Theorem 4.6 The Division Algorithm in F[x] 


Let F be a field and f(x), g(x) EF [x] with g(x) + 0, Then there exist unique 
polynomials g(x) and r(x) such that 


f(x) = g(x)g(x) + r(x) and either r(x) =O, or deg r(x) < deg g(x). 
Example 9 shows how polynomial division works and why the Division Algorithm 
is valid in one particular case. 


EXAMPLE 9 


We shall divide f(x) = 3x5 + 2x* + 2x? + 4x? + x — 2 by g(x) = 2x3 + 1. The 
italic column on the right keeps track of what happens at each step.* 


divisor g(x) 
| = + x+1 < quotient q(x) 
D + 1[3x5 + 2x4 + 2x8 + 4x7 + x—2 e dividend f(x) 
axS + 3 « (5) eco 
ait + 2x + 8+ 2 -r - (5) 209 
2x* +x = xg(x) 
3 +597 -2 -f(x)- Geko — xg(x) 
2x3 +1 —1g(x) 
remainder r(x) —>3¥ -3 <f@)- (3 x) — xg(x) — lg) = 


Fx) — a(x) (ze +x+ 1) = 
Jœ) — go)ax) 


The last line on the left side and the last three lines on the right side show that 


fœ) — g(x)g(x) = x) or equivalently, f(x) = g(x)q(x) + r(x). 
So the Division Algorithm holds for the polynomials f(x) and g(x). 


“Division Refasher:The first term of the quotient = is one by dividing the leading term of the 


dividend (32°) by the leading term of the divisor en; Bx5/2x3 = Sa, The product of this term and the 
divisor ( (Ze Jot) i is then subtracted from the dividend ania ind + 2e + ae +x-—2,as 


shown. The process is repeated, using this lastexpression as the dividendand ie same divisor, and 
continues until you reach a polynomial with degree smaller than the degree of the divisor. 
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Of course, an example is not a proof, even though you can readily convince your- 
self that the same procedure works with other divisors and dividends (Exercise 5). 
Consequently, skipping the proof until you are familiar with mathematical induc- 
tion, would be quite reasonable. That’s why the proof of Theorem 4.6 is marked 
optional. 


Proofof Theorem 4.6 The Division Algorithm (Optional) » 
We first prove the existence of the polynomials g(x) and r(x). 

Case 1: If f(x) = 07 or if deg f(x) < deg g(x), then the theorem is true 
with q(x) = Or and (x) = f(x) because f(x) = g(x)0p + f(x). 

Case 2: If f(x) + 0r and deg g(x) = deg f(x), then the proof of exis- 
tence is by induction on the degree of the dividend f(x).* If deg f(x) = 
then deg g(x) = 0 also. Hence, f(x) = a and g(x) = b for some nonzero 
a, bE F. Since Fis a field, b is a unit and a = b(b~!a) + Op. Thus the 
theorem is true with q(x) = b-'a and r(x) = 

Assume inductively that the theorem is true whenever the dividend 
has degree less than x. This part of the proof is presented in two columns. 
The left-hand column is the formal proof, while the right-hand column 
refers to Example 9. The example will help you understand what’s being 


done in the proof. 
PROOF EXAMPLE 9 
We must show that the theorem is true whenever 
the dividend f(x) has degree n, say n=5 
pas 4 3 = 
imu bdo beth f(x) = 3 + ZÂ 2? tot ee 
with a, + Op. The divisor g(x) must have the ia 
form m=3 
glx) = bax" +--+ + bix + by a(x) = 2x7 +1 
=i 
with 5,, # Opand m = n. We begin as we would ba” 
in the long division of g(x) into f(x). Since F is a 
field and bn # Op, bm is a unit. Multiply the divi- et E E eer 
sor g(x) by a,b, 'x"-™ to obtain abm XM = 3 + 2-1 = * 
firs of 
t term o, 
= = 
Dm x" B(x) A the quotient 
= gb Ebat) Fel) =F + 1) 
3 
= A" + Agden bm! H -+ + AD By ™ = 3x5 + ral 


*We use the Principle of Complete Induction; see Appendix C. 
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Since abm 1x*"g(x) and f(x) have the same 
degree and the same leading coefficient, the 


difference 3 

JO- abn 22) K~) — egla) 
is a polynomial of degree /ess thann (or possibly = f(x) - (3 + a) 
the zero polynomial). Now apply the induction 2 
hypothesis with g(x) as divisor and the poly- = 2x4 + 23 ee +x—-2 
nomial f(x) — abm "x" g(x) as dividend (or 2 
use Case 1 if this dividend is zero). By induction fourth line of long division 


there exist polynomials q,(x) and r(x) such that 
SO- EAO and g(x) = x41 r(x) = 238-3 
r(x) = 0p or deg r(x) < deg g(x). Gane PETE Pa 
the quotient 
Therefore, 
S) = BC) apb yD + G4(x)] rA and 
x)= 0, or deg r(x) < deg g(x). 


Thus the theorem is true with g(x) = a,b, 1x°-"+ q,(x) when deg f(x) = n. This completes 
the induction and shows that q(x) and r(x) always exist for any divisor and dividend. 
To prove that q(x) and r(x) are unique, suppose that q,(x) and r(x) are polynomials 


such that 
F(x) = BX)q(x) + r(x) and — r(x) = Op or deg r(x) < deg g(x). 
Then 
B(x)q(x) + r(x) = f(x) = gax) + r(x), 
so that 


BOG) — H) = rx) — rx). 
If q(x) — q(x) is nonzero, then by Theorem 4.2 the degree of the left side is deg g(x) + 
deg{g(x) — 9{x)], a number greater than or equal to deg g(x). However, both r(x) and r(x) 
have degree strictly less than deg g(x), and so the right-hand side of the equation must also 
have degree strictly less than deg g(x) (Exercise 12). This is a contradiction. Therefore 
Ax) — q(x) = Op, or equivalently, g(x) = q(x). Since the left side is zero, we must have 
r(x) — r(x) = 0p so that r(x) = r(x). Thus the polynomials q(x) and r(x) are unique @ 


E Exercises 


NOTE: R denotes a ring and F a field. 

A. 1. Perform the indicated operation and simplify your answer: 
(a) (3x4 + 2x3 — 4x7 + x + 4) + (428 + x? + 4x + 3) in Ze] 
(b) (x + DF in Zi] 
© & — Din Zd] 
(d) (x? — 3x + 2)(2x3 — 4x + 1) in Z,[x] 
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2. Show that the set of all real numbers of the form 
ag tam +am? +: + am", withn=Oanda, EZ 
is a subring of R that contains both Z and 7. 
3. (a) List all polynomials of degree 3 in Z,[x]. 
(b) List all polynomials of degree less than 3 in Z,[x]. 


4. In each part, give an example of polynomials f(x), g(x) € Q[x] that satisfy the 
given condition: 


(a) The deg of f(x) + g(x) is less than the maximum of deg f(x) and deg g(x). 
(b) Deg [f(x) + a(x)] = max {deg f(x), deg a(x)}. 


5. Find polynomials g(x) and r(x) such that f(x) = g(x)g(x) + r(x), and r(x) = 
or deg r(x) < deg g(x): 


(a) f(x) = 3x* — 2x3 + 6? — x + Zand g(x) = x + x + 1 in Of. 
(b) f(x) = x* — 7x + 1 and g(x) = 2x7 + 1 in Qf. 
(c) fx) = 2x4 + 2 — x + 1 and g(x) = 2x — 1 in Zs[x]. 
(d) f(x) = 4x4 + 2x3 + 6x7 + 4x + 5 and g(x) = 3x7 + 2in Z,[x]. 
6. Which of the following subsets of R[x] are subrings of R[x]? Justify your answer: 
(a) All polynomials with constant term Op. 
(b) All polynomials of degree 2. 
(c) All polynomials of degree = k, where k is a fixed positive integer. 
(d) All polynomials in which the odd powers of x have zero coefficients. 
(e) All polynomials in which the even powers of x have zero coefficients. 
7. If Ris commutative, show that R[x] is also commutative. 
8. If R has multiplicative identity 1g, show that 1, is also the multiplicative 
identity of R[x]. 
9. If c E Risa zero divisor in a commutative ring R, then is ¢ also a zero divisor 
in R[x]? 
10. If F is a field, show that F[x] is not a field. [Hint: Is x a unit in F[x]?] 
B.11. Show that | + 3x is a unit in %,[x]. Hence, Corollary 4.5 may be false if Ris 
not an integral domain. 
12. If f(x), g(x) € R[x] and f(x) + g(x) + Op, show that 


deg[ f(x) + g(x] = max {deg f(x), deg g(x)}. 


13. Let R be a commutative ring. If a, # Op and f(x) = a + ax + ape + ++ + 
a,x" (with a, + Og) is a zero divisor in R[x], prove that a, is a zero divisor in R. 


14. (a) Let R be an integral domain and f(x), g(x) € R[x]. Assume that the 
leading coefficient of g(x) is a unit in R. Verify that the Division Algorithm 
holds for f(x) as dividend and g(x) as divisor. [Hint: Adapt the proof of 
Theorem 4.6. Where is the hypothesis that F is a field used there?] 
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(b) Give an example in Z[x] to show that part (a) may be false if the leading 
coefficient of g(x) is not a unit. [Hint: Exercise 5(b) with Z in place of Q.] 


15. Let R be a commutative ring with identity anda E R. 
(a) If a’ = Og, show that 1p + ax is a unit in R[x]. [Hint: Consider 1 — ax + 
a?x?.] 
(b) If at = Og, show that lp + ax is a unit in R[x]. 


16. Let R be a commutative ring with identity anda E€ R. If 1p + ax is a unit in 
R[x], show that a" = Og for some integer n > 0. [Hint: Suppose that the inverse 
of lg + axis by + hix + bax? +--+ + bpč. Since their product is 1p, bo = 1p 
(Why?) and the other coefficients are all 0p.) 


17. Let R be an integral domain. Assume that the Division Algorithm always 
holds in R[x]. Prove that R is a field. 


18. Let ¢:R[x] — R be the function that maps each polynomial in R[x] onto its 
constant term (an element of R). Show that @ is a surjective homomorphism 
of rings. 


19. Let @:Z[x] — Z,[x] be the function that maps the polynomial ay + ajx + +++ + 
a,x in Z[x] onto the polynomial [ao] + [a;]x + - + - + [a;)x“, where [a] denotes 
the class of the integer a in Z,. Show that ¢ is a surjective homomorphism of 
rings. 


20. Let D:R[x] > R[x] be the derivative map defined by 
Dla + ax + ay? + +++ + ap”) =a, + 2agx + Bax? +++ + nae! 
Is D a homomorphism of rings? An isomorphism? 


C.21. Let k:R-> S be a homomorphism of rings and define a function A:R[x] -> Sx] 
by the rule 


hla + ax +++» + aX) = h(a) + h(ay)x + h(a)? +--+ + h(a)". 
Prove that 
(a) h is a homomorphism of rings. 
(b) A is injective if and only if 4 is injective. 
(c) h is surjective if and only if h is surjective. 
(d) If R = S, then R[x] = Six]. 


22. Let R be a commutative ring and let k(x) be a fixed polynomial in R[x]. Prove 
that there exists a unique homomorphism ¢: R[x] —> R[x] such that 


o(r)=rforallre R and (x) = k(x). 


42 Divisibility in F[x] 


All the results of Section 1.2 on divisibility and greatest common divisors in Z now 
carry over, with only minor modifications, to the ring of polynomials over a field. 
Throughout this section, F always denotes a field. 
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Definition Let F be a field and a(x), b(x) € F[x] with b(x) nonzero. We say that b(x) 
divides a(x) [or that b(x) is a factor of a(x)J, and write b(x) |.a(x) if a(x) = 
b(x)h(x) for some h(x) e fx). 


EXAMPLE 1 


(2x + 1) |(6x? — x — 2) in Q[x] because 6x? — x — 2 = (2x + 1)(3x — 2). 
Furthermore, every constant multiple of 2x + 1 also divides 6x? — x — 2. For 
instance, 5(2x + 1) = 10x + 5 divides6x? — x — 2 because 6x7 — x — 2 = 


5(2x + f5 — 2| 


Example 1 illustrates the first part of the following result. 


Theorem 4.7 
Let F be a field and a(x), b(x) € Fix] with b(x) nonzero. 


(1) If b(x) divides a(x), then cb(x) divides a(x) for each nonzero c e€ F. 
(2) Every divisor of a(x) has degree less than or equal to deg a(x). 


Proof » (1) If (x) | a(x), then a(x) = A(x)i(x) for some A(x) € Fix]. Hence, 


Ax) = 1p B(x)h(x) = cc 'b(x)h(x) = cb(x)fe h(x). 
Therefore, cb(x) | a(x). 


(2) Suppose b(x) | a(x), say a(x) = b(x)h(x). By Theorem 4.2, 
deg a(x) = deg b(x) + deg h(x). 


Since degrees are nonnegative, we must have 0 < deg b(x) = dega(x). m 


As we learned earlier, the greatest common divisor of two integers is the largest 
integer that divides both of them. By analogy, the greatest common divisor of two 
polynomials a(x), b(x) € F[x] ought to be the polynomial of highest degree that divides 
both of them. But such a greatest common divisor would not be unique because each 
constant multiple of it would have the same degree and would also divide both a(x) 
and 5(x). In order to guarantee a unique gcd, we modify this definition slightly by 
introducing a new concept. A polynomial in F[x] is said to be monic if its leading 
coefficient is 1, For instance, x? + x + 2 is monicin Q[x], but 2x + 1 is not. 


Definition Let F be a field and a(x), (x) © FAx], not both zero. The greatest common 
divisor (gcd) of a(x) and b(x) is the monic polynomial of highest degree 
that divides both a(x) and b(x). 
In other words, d(x) is the ged of a(x) and b(x) provided that d(x) is monic and 
(1) d(x) |ax) and d(x) |b(x); 
(2) If c(x) |a(x) and c(x) |b(x), then deg e(x} = deg d(x). 
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Polynomials a(x) and B(x) have at least one monic common divisor (namely 1p). Since 
the degree of a common divisor of a(x) and h(x) cannot exceed either deg a(x) or deg (x) 
by Theorem 4.7, there must be at least one monic common divisor of highest degree. In 
Theorem 4.8 below we shall show that there is only one monic common divisor of highest 
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degree, thus justifying the definition’s reference to the greatest common divisor. 
EXAMPLE 2 
To find the ged of 3x7 + x + 6 and 0 in Qf[x], we note that the common divisors 
of highest degree are just the divisors of 3x” + x + 6 of degree 2. These include 


3x? + x + 6 itself and all nonzero constant multiples of this polynomial—in 
particular, the monic polynomial 


FOE + xt =x + txt 2. 


Hence, x + Sx + 2isa god of 3° + x + 6 and 0. 
EXAMPLE 3 


You can easily verify these factonizations in Q[x]: 
a(x) = 2x4 + 5x3 — 5x — 2 = (2x + I(x + 2X(x + Ix — 1), 
B(x) = 2x3 — 3x? — 2x = (2x + 1)(x — 2)x. 
It appears that 2x + 1 isa common divisor of highest degree of a(x) and b(x). 
In this case, the constant multiple lox +1l)=x+ 73s a monic common divi- 


sor of highest degree. For a proof that x + 2 actually is the greatest common 
divisor, see Exercise 5(g). 


The remainder of this section, which is referred to only a few times in the rest of 
the book, may be skimmed if time is short—read the theorems and corollaries, but 
skip the proofs, 


Theorem 4,8 


Let F be a field and a(x), b(x) e [x], not both zero. Then there is a unique great- 
est common divisor d(x) of a(x) and b(x). Furthermore, there are (not neces- 
sarily unique) polynomials u(x) and v(x) such that d(x) = a(x)u(x) + b(x)v(x). 


Steps 1 and 2 of the proof are patterned after the proof of Theorem 1.2. 


Proof of Theorem 4.8 > Let S be the set of all linear combinations of ax) and 
b(x), that is, 
S = {a(x)m(x) + b(x)n(x) | m(x), nx) € Fp]. 
Step 1 Find a monic polynomial of smallest degree in S. 


Proof of Step 1: S contains nonzero polynomials (for instance, at least 
one of a(x)" Lg + B(x): Opor a(x) * Op + B(x) 1x). So the set of all 
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degrees of polynomials in S is a nonempty set of nonnegative integers, 
which has a smallest element by the Well-Ordering Axiom. Hence, there 
is a polynomial w(x) of smallest degree in S. If d is the leading coef- 
ficient of w(x), then (x) = d—w(x) is a monic polynomial of smallest 
degree in S. By the definition of S, 
t(x) = a(x)u(x) + b(x)(x) for some u(x), v(x) € Fix]. 

Step 2 Prove that t(x) is a gcd of a(x) and b(x). 
Proof of Step 2: We must prove that t satisfies the two conditions in the 
definition of the gcd: 


(1) #(x)| a(x) and t(x) |b); 
(2) If c(x)|a(x) and c(x) | d(x), then deg c(x) = deg t(x). 

Proof of (1): In the proof of Step 2 of Theorem 1.2, replace a, b, 
c, t, q, r, u, v, k, and s with a(x), B(x), c(x), (x), g0), rx), uO), 


v(x), k(x), and s(x), respectively, to show that t(x) is a common 
divisor of a(x) and A(x). 


Proof of (2): With the same replacements as in the proof of (1), 
repeat the proof of Step 2 of Theorem 1.2, until you reach this 
statement: 

t(x) = a(xju(x) + B(2)o(x) = [eeku + [e(2) s(x) u(x) 
= ex)[k(x)u(x) + s(x)(~)].- 
The first and last terms of this equation show that (x)| A(x). By 
Theorem 4.7, deg c(x) = deg t(x). 
This shows that ¢(x) is a greatest common divisor of f(x) and g(x). 


Step 3 Prove that t(x) is the unique gcd of a(x) and K(x). 


Proof of Step 3: Suppose that d(x) is any ged of a(x) and B(x). To prove 
uniqueness, we must show that d(x) = i(x). Since d(x) is a common divi- 
sor, we have a(x) = d(x)f(x) and (x) = dx)g(x) for some f(x), g(x) E F [x]. 


Therefore, 
t(x) = a(x)ulx) + boul) = [do ulx) + dodge) 
= Ax) f(x)u) + ge). 
By Theorem 4.2, 


deg t(x) = deg d(x) + deg [f(x)u(x) + g(x)u(~)]. 
Since they are gcd’s, t(x) and d(x) have the same degree. Hence, 


deg [/(x)u(x) + g(x)e()] = 0, 


so that f{x)u(x) + 9(x)e(x) = c for some constant c € F. Therefore, 

(x) = dx)c. Since both (x) and d(x) are monic, the leading coefficient 
on the left side is 1, and the leading coefficient on the right side is c. So 
we must have c = 1p Therefore, d(x) = t(x) = a(x)u(x) + b(x){x) is the 
unique gcd of a(x) and b(x). E 
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Corollary 4.9 


Let F be a field and a(x), b(x)ef[x], not both zero. A monic polynomial 
d(x) e Fix] is the greatest common divisor of a(x) and b(x) if and only if d(x) 
satisfies these conditions. 

(i) a(x) | a(x) and d(x) | b(x). 

(ii) if c(x) | a(x) and c(x) | (x), then c(x) | d(x). 


Proof» Adapt the proof of Corollary 1.3 to Fix]. E 


Polynomials f(x) and g(x) are said to be relatively prime if their greatest common 
divisor is 1p 


Theorem 4,10 


Let F be a field and a(x), b(x), c(x) € Fix]. If a(x) | b(x)c(x) and a(x) and b(x) are 
relatively prime, then a(x) |c(x). 


Proof» Adapt the proof of Theorem 1.4 to F[x]. m 


@ Exercises 


NOTE: F denotes a field. 
A. 1. If f(x) € F[x], show that every nonzero constant polynomial divides fx). 
2. If f(x) = cew” + +++ + cywith c, # Or, what is the gcd of f(x) and 07? 
3. If a,b€ Fand a + b, show that x + a and x + b are relatively prime in F[x]. 
4. (a) Let f(x), g(x) E Fix]. If fC) | g(x) and g(x) Lx), show that f(x) = cg(x) for 
some nonzero cE F. 
(b) If f(x) and g(x) in part (a) are monic, show that f(x) = g(x). 
5. The Euclidean Algorithm for finding gcd’s is described for integers in Exercise 15 
of Section 1.2. The process given there also works for polynomials over a 
field, with one minor adjustment. For integers, the last nonzero remainder is 
the ged. For polynomials the last nonzero remainder is a common divisor of 
highest degree, but it may not be monic. In that case, multiply it by the inverse 


of its leading coefficient to obtain the gcd. Use the Euclidean Algorithm to 
find the gcd of the given polynomials: 


(a) x4 — x? — 7° + Land x — 1 in Qfx] 

(b) £ + x4 + 2x3 — e — x — 2and it + 2x3 + 5x? + 4x + 4 in Q[x] 
(c) x4 + 32° + 2x +4 and x? — 1 in Zd 

(d) 4x4 + 2x3 + 6x? + 4x + 5 and 3x? + 5x? + 6x in Z,[x] 
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(e) Æ — ix? + 4x — 4i and x* + 1 in C[x] 
() xt+x+ land + x+ linZJx] 
(g) 2x* + 5x3 — 5x — 2 and 2x3 — 3x? — 2x in Q[x]. 
6. Expresseach of the gcd’s in Exercise 5 as a linear combination of the two 
polynomials, 
B. 7. Let f(x) € F[x] and assume that f(x)| g(x) for every nonconstant g(x) E€ F[x]. Show 
that f(x) is a constant polynomial. [Hint: f(x) must divide both x + 1 and x.] 


8. Let f(x), g(x) € F[x], not both zero, and let d(x) be their gcd. If A(x) is a 
common divisor of f(x) and g(x) of highest possible degree, then prove that 
h(x) = ed{x) for some nonzero cE F. 


9. If f(x) # Of and f(x) is relatively prime to Oz, what can be said about f(x)? 

10. Find the gcd of x + a + band x? — 3abx + @ + b in Q[x]. 

11. Fill in the details of the proof of Theorem 4.8. 

12. Prove Corollary 4.9. 

13. Prove Theorem 4.10. 

14. Let f(x), g(x), A(x) € F[x], with f(x) and g(x) relatively prime. If f(x) |A(x) and 
gix) |A(x), prove that f(x)g(x) | A(x). 

15. Let f(x), g(x), A(x) © F[x], with f(x) and g(x) relatively prime. If A(x) | f(x), 
prove that A(x) and g(x) are relatively prime 


16. Let f(x), g(x), h(x) € F[x], with f(x) and g(x) relatively prime. Prove that the 
gcd of f(x)h(x) and g(x) is the same as the gcd of A(x) and g(x). 


43 Irreducibles and Unique Factorization 


Throughout this section F always denotes a field. Before carrying over the results of 
Section 1.3 on unique factorization in Z to the ring F[x], we must first examine an area 
in which Z differs significantly from F[x]. In Z there are only two units,* namely +1, 
but a polynomial ring may have many more units (see Corollary 4.5). 

An element æ in a commutative ring with identity R is said to be an associate of an 
element bof R if a = bu for some unit u. In this case b is also an associate of æ because 
u`! is a unit and b = au™'. In the ring Z, the only associates of an integer n are n and 
—n because +1 are the only units. If F is a field, then by Corollary 4.5, the units in F[x] 
are the nonzero constants. Therefore, 


f(x) is an associate of g(x) in F [x] if and only if f(x) = eg(x) for some nonzero c E F. 


Recall that a nonzero integer p is prime in Z if it is not +1 (that is, p is not a unit 
in Z) and its only divisors are +1 (the units) and +p (the associates of p). In F[x] the 
units are the nonzero constants, which suggests the following definition. 


Unit” is defined just before Example 4 in Section 3.2. 
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Definition Let F be a field. A nonconstant polynomial p(x) =e Fix] is said to be 
irreducible* if its only divisors are its associates and the nonzero constant 
polynomials (units). A nonconstant polynomial that is not irreducible is 
said to be reducible. 


EXAMPLE 1 


The polynomial x + 2 is irreducible in Q[x] because, by Theorem 4.2, all its 
divisors must have degree 0 or 1. Divisors of degree 0 are nonzero constants. 
If f(x) | (sc + 2), say x + 2 = f(x)g(x), and if deg f(x) = 1, then g(x) has degree 
0, so that g(x) = c. Thus «(x + 2) = f(x), and f(x) is an associate of x + 2. A 
similar argument in the general case shows that 


every polynomial of degree 1 in F [x] is irreducible in F]x]. 


The definition of irreducibility is a natural generalization of the concept of primal- 
ity in Z. In most high-school texts, however, a polynomial is defined to be irreducible 
if it is not the product of polynomials of lower degree. The next theorem shows that 
these two definitions are equivalent. 


Theorem 4.11 


Let F be a field. A nonzero polynomial f(x) is reducible in F[x] if and only if f(x) 
can be written as the product of two polynomials of lower degree. 


Proof» First, assume that f(x) is reducible. Then it must have a divisor g(x) that 
is neither an associate nor a nonzero constant, say f(x) = g(x)h(x). If 
either g(x) or k(x) has the same degree as f(x), then the other must have 
degree 0 by Theorem 4.2. Since a polynomial of degree 0 is a nonzero 
constant in F, this means that either g(x) is a constant or an associate 
of f(x), contrary to hypothesis. Therefore, both g(x) and A(x) have lower 
degree than f(x). 

Now assume that f(x) can be written as the product of two polyno- 
mials of lower degree, and see Exercise 8. W 


Various other tests for irreducibility are presented in Sections 4.4 to 4.6. For now, 
we note that the concept of irreducibility is not an absolute one. For instance, x7 + 1 
is reducible in C[x] because x? + 1 = (x + i)(x — i) and neither factor is a constant or 
an associate of x? + 1. But x? + 1 is irreducible in Q[x] (Exercise 6). 

The following theorem shows that irreducibles in F[x] have essentially the same 
divisibility properties as do primes in Z. Condition (3) in the theorem is often used to 
prove that a polynomial is irreducible; in many books, (3) is given as the definition of 
“irreducible”. 


“You could just as well call such a polynomial “prime”, but “irreducible” is the customary term with 
polynomials. 
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Theorem 4,12 


Let F be a field and p(x) a nonconstant polynomial in F{x]. Then the following 
conditions are equivalent:* 


(1) p(x) is irreducible. 


(2) If B(x) and c(x) are any polynomials such that p(x) | b(x)e(x), then 
p(x) |6(x) or p(x) |c(x). 

(3) If r(x) and s(x) are any polynomials such that p(x) = r{x)s(x), then r(x) 
or s(x) is a nonzero constant polynomial. 


Proof» (1) = (2) Adapt the proof of Theorem 1.5 to F[x]. Replace statements 
about +p by statements about the associates of p(x); replace statements 
about +1 by statements about units (nonzero constant polynomials) in 
F[x]; use Theorem 4.10 in place of Theorem 1.4. 


(2) => (3) If p(x) = r{x)s(x), then p(x) | r(x) or p(x) |s), by (2). If 
P(x) |7(x), say r(x) = p(x)v(x), then p(x) = r(x)s(x) = p(x)u(x)s(x). Since 
F[x] is an integral domain, we can cancel p(x) by Theorem 3.7 and con- 
clude that 1p = u(x)s(x). Thus s(x) is a unit, and hence by Corollary 4.5, 
s(x) is a nonzero constant. A similar argument shows that if p(x)|s(x), 
then r{x) is a nonzero constant. 


(3) => (1) Let e(x) be any divisor of p(x), say p(x) = c(x)d(x). Then 
by (3), either c(x) is a nonzero constant or d(x) is a nonzero constant. If 
d(x) = d # 0, then multiplying both sides of p(x) = c(x)d(x) = de(x) by 
d~! shows that e(x) = d~'p(x). Thus in every case, ¢(x) is a nonzero con- 
stant or an associate of p(x). Therefore, p(x) is irreducible. m 


Corollary 4.13 


Let F be afield and p(x) an irreducible polynomial in Fix]. If p(x) |a; (ax) + + + A(X), 
then p(x) divides at least one of the a(x). 


Proof» Adapt the proof of Corollary 1.6 to Fix]. m 


Theorem 4,14 


Let F be a field. Every nonconstant polynomial f(x) in F[x] is a product of 
irreducible polynomials in F[x].' This factorization is unique in the following 
sense: If 


f(x) = POPX) + pdx) and f(x) = 94x) o(x) + + > q(x) 


*For the meaning of “the following conditions are equivalent” and what must be done to prove 
Theorem 4.12, see page 508 of Appendix A. Example 2 there is the integer analogue of Theorem 4.12. 


tWe allow the possibility of a product with just one factor in case f(x) is itself irreducible. 
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with each pfx) and q(x) irreducible, then f = s (that is, the number of irre- 
ducible factors is the same). After the q(x) are reordered and relabeled, if 
necessary, 


páx) is an associate of qx) G=1,2,3,...,7). 


Proof» To show that J(x) is a product of irreducibles, adapt the proof of 
Theorem 1.7 to F[x]: Let S be the set of all nonconstant polynomials 
that are not the product of irreducibles, and use a proof by contradiction 
to show that S is empty. To prove that this factorization is unique up to 
associates, suppose f(x) = py(x)p2(x) * * + P) = aax) +> * 9A) 
with each p(x) and g(x) irreducible. Then p,(x)[p2(x) - + > p(x) = 
qı(x)q(x) + + gfx), so that p,(x) divides g,(x)g2(x) - - - g{x). Corollary 
4.13 shows that p,(x)|9,(x) for some j. After rearranging and relabel- 
ing the q(x)’s if necessary, we may assume that p(x) | q(x). Since g;(x) 
is irreducible, p(x) must be either a constant or an associate of q(x). 
However, p(x) is irreducible, and so it is not a constant. Therefore, p,(x) 
is an associate of g,(x), with p(x) = cıqı(x) for some constant c,. Thus 


NMP Pax) «> + PEN = Pi)PAX) + + + PAX) = NOG) - + - GA). 
Canceling g,(x) on each end, we have 


Pep) + + * PA) = gax) + + + aA). 


Complete the argument by adapting the proof of Theorem 1.8 to F[x], 
replacing statements about +g; with statements about associates of 
q(x). m 


E Exercises 


NOTE: F denotes a field and p a positive prime integer. 
A. 1. Find a monic associate of 
(a) 3x3 + 2x? + x + 5 in Qix] (b) 3x° — 4x? + 1 in Zax] 
(e) ix + x -— 1 in Ch] 
2. Prove that every nonzero f(x) € F[x] has a unique monic associate in F[x]. 
3. List all associates of 
(a) ê+ x+ 1inZdxq (b) 3x + 2 in Z,[x] 
4. Show that a nonzero polynomial in Z,{x] has exactly p — 1 associates. 


5. Prove that f(x) and g(x) are associates in F[x] if and only if f(x) | g(x) and 
(x) E). 

6. Show that x7 + 1 is irreducible in Q[x]. [Hint: If not, it must factor as 
(ax + b)(ex + d) with a, b, e, d © Q; show that this is impossible.] 


7. Prove that /(x) is irreducible in F[x] if and only if each of its associates is 
irreducible. 
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8. 


22. 


24. 


If f(x) € F[x] can be written as the product of two polynomials of lower 
degree, prove that f(x) is reducible in F[x]. (This is the second part of the 
proof of Theorem 4.11.) 


. Find all irreducible polynomials of 


(a) degree 2 in Z,[x] (b) degree 3 in Z,[x] 
(c) degree 2 in Z,[x] 


. Is the given polynomial irreducible: 


(a) x? — 3 in Q[x]? In R[x]? 
(b) x7 + x — 2 in Z,[x]? In Z,[x]? 


. Show that x? — 3 is irreducible in Z,[x]. 
. Express x* — 4 as a product of irreducibles in Q[x], in R[x], and in C[x]. 
. Use unique factorization to find the ged in C[x] of (x ~ 3)(x — 4} (x ~ 0* 


and (x — I(x — 3\(x — 4)°. 


. Show that x? + x can be factored in two ways in Z,[x] as the product of non- 


constant polynomials that are not units and not associates of x or x + 1. 


. (a) By counting products of the form (x + a)(x + b), show that there are 


exactly (p° + p)/2 monic polynomials of degree 2 that are not irreducible in 
Z,[x]. 

(b) Show that there are exactly (p* — p)/2 monic irreducible polynomials of 
degree 2 in Z [x]. 


. Prove that p(x) is irreducible in F[x] if and only if for every g(x) € F[x], either 


P(x)| g(x) or p(x) is relatively prime to g(x). 


. Prove (1) = (2) in Theorem 4.12. 


. Without using statement (2), prove directly that statement (1) is equivalent to 


statement (3) in Theorem 4.12. 


. Prove Corollary 4.13. 
. If p(x) and g(x) are nonassociate irreducibles in F[x], prove that p(x) and q(x) 


are relatively prime. 


. (a) Find a polynomial of positive degree in Z,[x] that is a unit. 


(b) Show that every polynomial (except the constant polynomials 3 and 6) 
in Zo[x] can be written as the product of two polynomials of positive 
degree. 


(a) Show that x3 + a is reducible in Z,[x] for eacha E Z}. 
(b) Show that xê + ais reducible in Z.[x] for eacha E Zs. 


. (a) Show that x? + 2 is irreducible in Z dx]. 


(b) Factor x° — 4 as a product of irreducibles in Z.[-]. 


Prove Theorem 4.14. 
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25. Prove that every nonconstant f(x) E F[x] can be written in the form 
cp\(X)p(x) - «+ p,(x), with c € F and each p(x) monic irreducible in F[x]. 
Show further that if f(x) = dqi(x)go(x) ` ` ` ¢n(x) with d E F and each g(x) 
monic irreducible in F[x], then m = n, c = d, and after reordering and 
relabeling if necessary, p;(x) = g,(x) for each i. 


aa Polynomial Functions, Roots, and Reducibility 


In the parallel development of F[x] and Z, the next step is to consider criteria for 
irreducibility of polynomials (the analogue of primality testing for integers). Unlike 
the situation in the integers, there are a number of such criteria for polynomials whose 
implementation does not depend on a computer. Most of them are based on the fact 
that every polynomial in F[x] induces a function from F to F. The properties of this 
function (in particular, the places where it is zero) are closely related to the reducibility 
or irreducibility of the polynomial. 

Throughout this section, R is a commutative ring. Associated with each polynomial 
ap” + +++ + ax? + ax + ay in R[x] is a function fR — R whose rule is 

foreachre R, f(r) =a" +: +a + ar + ao. 


The function f induced by a polynomial in this way is called a polynomial function. 


EXAMPLE 1 


The polynomial x? + 5x + 3 € R[x] induces the function fR > R whose rule 
is f(r) = r? + Sr + 3 foreachr E R. 


EXAMPLE 2 
The polynomial x* + x +1 € Z,[x] induces the function f:Z, + Z, whose rule 
is f(r) = 4 + r + 1. Thus 
{OH =8+0+1=1, fA)=Ż+1+1=0, 
fQ=%A4+241=1. 
The polynomial x + x? + 1 € Z,[x] induces the function g:Z,— Z; given by 
g0)=O+0+1=1, gl)=P+P+1=0, 
a2)=2+24+1=1. 


Thus f and g are the same function on Z3, even though they are induced by 
different polynomials in Z;[x].* 


Although the distinction between a polynomial and the polynomial function it 
induces is clear, the customary notation is quite ambiguous. For example, you will see a 


“Remember that functions fand g are equal if Rr) = g(r) for every rin the domain. 
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statement such as f(x) = x* — 3x + 2. Depending on the context, f(x) might denote the 
polynomial x? — 3x + 2 €e R[x] or the rule of its induced function f: R —> R. The sym- 
bol x is being used in two different ways here. In the polynomial x? — 3x + 2, xis an 
indeterminate (transcendental element) of the ring R[x].* But in the polynomial func- 
tion f:R — R, the symbol x is used as a variable to describe the rule of the function. It 
might be better to use one symbol for an indeterminate and another for a variable, but 
the practice of using x for both is so widespread you may as well get used to it. 

The use of the same notation for both the polynomial and its induced function also 
affects the language that is used. For instance, one says “evaluate the polynomial 
3x? — 5x + 4at x = 2” or “substitute x = 2 in 3x? — 5x + 4” when what is really meant 
is “find f (2) when fis the function induced by the polynomial 3x? — 5x + 4”. 

The truth or falsity of certain statements depends on whether x is treated as an 
indeterminate or a variable. For instance, in the ring R[x], where x is an indetermi- 
nate (special element of the ring), the statement x? — 3x + 2 = 0 is false because, by 
Theorem 4.1, a polynomial is zero if and only if all its coefficients are zero. When x is a 
variable, however, as in the rule of the polynomial function f(x) = x? — 3x + 2, things 
are different. Here it is perfectly reasonable to ask which elements of R are mapped to 0 
by the function f, that is, for which values of the variable x is it true that x? — 3x + 2 = 0. It 
may help to remember that statements about the variable x occur in the ring R, whereas 
statements about the indeterminate x occur in the polynomial ring R[x]. 


Roots of Polynomials 


Questions about the reducibility of a polynomial can sometimes be answered by 
considering its induced polynomial function. The key to this analysis is the concept 
of a root. 


Definition Let R be a commutative ring and f(x) © Ax]. An element a of R is said to 
be a root {or zero) of the polynomial ftx} if f(a) = Og, that is, if the induced 


function fR —> R maps @ to Og, 


EXAMPLE 3 


The roots of the polynomial f(x) = x? — 3x + 2 € R[x] are the values of the 
variable x for which f(x) = 0, that is, the solutions of the equation x” — 3x + 2 = 0. 
It is easy to see that the roots are | and 2. 


EXAMPLE 4 


The polynomial x? + 1 € R[x] has no roots in R because there are no real- 
number solutions of the equation x? + 1 = 0. However, if x? + 1 is considered 
as a polynomial in C[x], then it has i and —i as roots because these are the 
solutions in € of x? + 1 = 0. 


*See page 550 in Appendix G for more information. 
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Theorem 4.15 The Remainder Theorem 


Let F be a field, f(x) © Fix], and a e F. The remainder when f(x) is divided by 
the polynomial x — a is f(a). 


EXAMPLE 5 


To find the remainder when f(x) = x” + 3x” + 5 is divided by x — 1, we apply 
the Remainder Theorem with a = 1. The remainder is 


A) = 19 4+3-1%+5=14+34+5=9. 
EXAMPLE 6 


To find the remainder when f(x) = 3x‘ — 8x7 + 11x + 1 is divided by x + 2, we 
apply the Remainder Theorem carefully. The divisor in the theorem is x — a, 


not x + a. So we rewrite x + 2 as x — (—2) and apply the Remainder Theorem 
with a = —2. The remainder is 


fD = 3(—2)* — 8(—2)? + 11(—2) + 1 = 48 — 32 — 22 + 1 = —5. 


Proof of Theorem 4.15> By the Division Algorithm, f(x) = (x — a)q(x) + r(x), 
where the remainder r(x) either is 07 or has smaller degree than the 
divisor x — a. Thus deg r(x) = 0 or x(x) = Op. In either case, r(x). = e for 
somec E F. Hence, f(x) = (x — a)q(x) + c, so that f(a) = (a — a)g(a) + 
c=O0;+c=c. E 


Theorem 4.16 The Factor Theorem 


Let F be a field, f(x) € [x], and a e F. Then a is a root of the polynomial f(x) 
if and only if x — ais a factor of f(x) in F{x]. 


Proof» First assume that a is a root of fx). Then we have 
S(x) = (x — a)q(x) + r(x) [Division Algorithm] 
fœ) = (x — a)q(x) + fla) [Remainder Theorem) 
f(x) = (x — a)q(x) [a is a root of f(x), so f(a) = 07] 
Therefore, x — a is a factor of f(x). 


Conversely, assume that x — a is a factor of f(x), say f(x) = (x — a)g(x). 
Then a is a root of f(x) because f(a) = (a — a)g(a) = Org(a) = Op E 
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EXAMPLE 7 


To show that xX? — x5 + 2x4 — 3x? — x + 2 is reducible in Q[x], note that lisa 
root of this polynomial. Therefore, x — 1 is a factor. 


Corollary 4.17 


Let F be a field and f(x) a nonzero polynomial of degree n in F[x]. Then f(x) has 
at most n roots in F. 


Proof" > 1f f(x) has a root a, in F, then by the Factor Theorem, f(x) = (x — a;)h,(x) 
for some h(x) € Fix]. If h(x) has a root a, in F, then by the Factor 
Theorem 


fx) = (x — a(x — a(x) for some k(x) € Fix]. 


If A,(x) has a root a; in F, repeat this procedure and continue doing so 
until you reach one of these situations: 


(1) f(x) = (x — a(x — a) + + + Ge — anbhy(>) 
(2) f(x) = (x — a(x — a) ++ + (x — a,)Ay{x) and h(x) has no 
root in F. 
In Case (1), by Theorem 4.2, we have 


deg f(x) = deg(x — a;) + deg(x — ag) + +++ + deg(x — ay) + deg h,(x) 
n=1+14+++:+1+4 deg (x) 
=n + deg h,(x) 
Thus, deg h,(x) = 0, so h,(x) = c for some constant ¢ € F and f(x) 
factors as 


F(x) = dx — ay)(x — a) + + +(x% — Gy). 


Clearly, the n numbers a), a, . . . , 4, are the only roots of f(x). 

The argument in Case (2) is essentially the same (just replace n by k) 
and leads to this conclusion: n = deg f(x) = k + deg A,{x). So the num- 
ber of roots iskand k =< n. E 


Corollary 4.18 


Let F be a field and f(x) e F[x], with deg f(x) = 2. If f(x) is irreducible in F[x], 
then f(x} has no roots in F. 


Proof» 1f f(x) is irreducible, then it has no factor of the form x — ain F[x]. 
Therefore, f(x) has no roots in F by the Factor Theorem. W 


“lf you prefer a proof by induction, see Exercise 29. 
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The converse of Corollary 4.18 is false in general. For example, x* + 2x7 + 1 = 
QÊ + D(x? + 1) has no roots in Q but is reducible in Q[x]. However, the converse is 
true for degrees 2 and 3. 


Corollary 4.19 


Let F be a field and let f(x} € F[x] be a polynomial of degree 2 or 3. Then f(x) is 
irreducible in F[x] if and only if f(x) has no roots in F. 


Proof » Suppose f(x) is irreducible. Then f(x) has no roots in F by Corollary 4.18. 
Conversely, suppose that f(x) has no roots in F. Then f(x) has no first- 
degree factor in F[x] because every first-degree polynomial cx + din F[x] 
has a root in F, namely —c~'¢. Therefore, if f(x) = r(x)s(x), neither r(x) 
nor s(x) has degree 1. By Theorem 4.2, deg f(x) = deg x) + deg s(x). 
Since f(x) has degree 2 or 3, the only possibilities for (deg r(x), deg s(x)) 
are (2, 0) or (0, 2) and (3, 0) or (0, 3). So either r(x) or s(x) must have 
degree 0, that is, either r(x) or s(x) is a nonzero constant. Hence, f(x) is 
irreducible by Theorem 4.12. W 


EXAMPLE 7 


To show that x? + x + 1 is irreducible in Z; [x], you need only verify that none 
of 0, 1,2, 3,4 € Zs is a root. 


We close this section by returning to its starting point, polynomial functions. 
Example 2 shows that two different polynomials in F[x] may induce the same function 
from F to F. We now see that this cannot occur if Fis infinite. 


Corollary 4.20 


Let F be an infinite field and f(x), g(x) © Fix]. Then f(x) and g(x) induce the 
same function from F to F if and only if f(x) = g(x) in ALx]). 


Proof » Suppose that f(x) and g(x) induce the same function from F to F. Then 
f(a) = ga), so that f(a) — g(a) = Op for every a E F. This means that 
every element of F is a root of the polynomial f(x) — g(x). Since F is 
infinite, this is impossible by Corollary 4.17 unless f(x) — g(x) is the 
zero polynomial, that is, (x) = g(x). The converse is obvious. W 


E Exercises 


NOTE: F denotes a field. 
A. 1. (a) Find a nonzero polynomial in Z,[x] that induces the zero function on Z}. 
(b) Do the same in Z,[x]. 
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2. Find the remainder when /{x) is divided by g(x): 
(a) f(x) = x! + x£ and g(x) = x — 1 in Q[x] 
(b) f(x) = 2x5 — 3x4 + æ — 2x7 + x — 8 and g(x) = x — 10 in Q[x] 
(©) f(x) = 10x75 — 8x + 6x45 + 4x — 2x!5 + 5 and a(x) =x + 1 in Q[x] 
(d) f(x) = 2x5 — 3x4 + x3 + 2x + 3 and g(x) = x — 3 in Z,[x] 
3. Determine if h(x) isa factor of f(x): 
(a) A(x) = x + 2 and f(x) = 28 — 3x? — 4x — 12 in R[x] 


(b) Wx) = x — Sand fx) = 2x4 +23 +x ~ inp] 


© A(x) = x + 2and f(x) = 39° + 4x* + 2x) — x@+ 2x + lin Zax] 
(d) A(x) = x — 3 and f(x) = x — $ + x — 5 in Z [x] 

4. (a) For what value of k is.x.— 2 a factor of x* — 5x° + 5x? + 3x + kin Q[x]? 
(b) For what value of k is x-+ 1 a factor of x‘: 2x — 3x? + kx + 1 in Zs[x]? 


. Show that x — 1pdivides.a,x" + » ++ + a,x? +-a,x + agin F[x] if and only if 
atata, +t +a = 0p 


6. (a) Verify that every element of Z, is a root of »* — x € Z,x]. 


a] 


(b) Verify that every element of Z, is a root of x5 — x € ZJx]. 
(c) Make a conjecture about the roots of x” — x € Z,[x] (p prime). 


7. Use the Factor Theorem to show that x’ — x factors in Z,[x] as 
x(x — IXx — 2)(x — 3)(x — 4)\(x — 5)(x — 6), without doing any polynomial 
multiplication. 


8. Determine if the given polynomial is irreducible: 

(a) 7 —7inR[x] = (b) x? — 7 in Qix] 

(c) 2 + 7 in CE (d) 2x? + x? + 2x + 2 in Z,[x] 

(e) © -—9inZy[x] (© x4 +x + 1LinZ,[x] 
9. List all monic irreducible polynomials of degree 2 in Z,[x]. Do the same in Z,[x]. 
10. Find a prime p > 5 such that x? + 1 is reducible in Z,[x}. 


11. Find an odd prime p for which x — 2 is a divisor of x* + x° + 32 + x + 1 in 
Zd]. 
B. 12. If a e Fis a nonzero root of cp” + & pe! + +++ + ex + cg E Fix], show 
that a7! is a root of eg” + cx) + +++ + Cp x + Cy 
13. (a) If f(x) and g(x) are associates in F[x], show that they have the same roots 
in F 
(b) If f(x), g(x) € Fiq] have the same roots in F, are they associates in F[x]? 


14. (a) Suppose r,s € Fare roots of ax? + bx + c € F[x] (with a + 07). Use the 
Factor Theorem to show that r + s = —a7!b and rs = a™e. 
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(b) Suppose r, s, t € Fare roots of ax} + bx? + ex + d e F[x] (with a + 0p). 
Show that r + s + t = —a~'band rs + st + rt = ae and rst = —a'd. 


Nn 


. Prove that x? + 1 is reducible in Z,[x] if and only if there exist integers a and b 
such that p = a + band ab = | (mod p). 


. Let f(x), g(x) € F[x] have degree = n and let co, c1, . . . , €n be distinct elements 
of F. If f(e) = g(e) for i = 0, 1, ..., n, prove that f(x) = g(x) in Fx]. 


17. Find a polynomial of degree 2 in Z,[x] that has four roots in Z,. Does this 
contradict Corollary 4.17? 


a 


18. Let g:€ — C be an isomorphism of rings such that g(a) = a for each 
a E€ Q. Suppose r € C isa root of f(x) € Q[x]. Prove that g(r) is also a 
root of f(x). 


19. We say that a € Fis a multiple root of f(x) € F[x]if (x — a¥ is a factor of 
f(x) for some k = 2. 


(a) Prove that a e R is a multiple root of f(x) € R[x] if and only if aisa 
root of both f(x) and f'(x), where f'(x) is the derivative of f(x). 
(b) If f(x) € R[x] and if f(x) is relatively prime to f(x), prove that f(x) has 
no multiple root in R. 
20. Let R be an integral domain. Then the Division Algorithm holds in R[x] 


whenever the divisor is monic, by Exercise 14 in Section 4.1. Use this fact to 
show that the Remainder and Factor Theorems hold in R[x]. 


2 


. If R is an integral domain and f(x) isa nonzero polynomial of degree n in 
R[x], prove that f(x) has at most n roots in R. [Hint: Exercise 20.] 


22. Show that Corollary 4.20 holds if Fis an infinite integral domain. [Hint: See 
Exercise 21.] 


23. Let f(x), g(x), A(x) € F[x]and r E F. 
(a) If f(x) = g(x) + A(x) in F[x], show that f(r) = g(r) + A(r) in F. 
(b) If f(x) = g(x)A(x) in F[x], show that f(r) = g(r)A(r) in F. 
Where were these facts used in this section? 


24. Let a be a fixed element of F and define a map ¢,:F[x] > F by g f(x)] = f(a). 
Prove that ¢, is a surjective homomorphism of rings. The map @, is called an 
evaluation homomorphism; there is one for each a E F. 


25. Let ©[7] be the set of all real numbers of the form 
rota +r t- + an", withn=O0andr, € Q. 
(a) Show that Q[7] is a subring of R. 


(b) Show that the function 6:Q[x] —> Q[7] defined by 0( f(x)) = f(T) is an 
isomorphism. You may assume the following nontrivial fact: 7 is not 
the root of any nonzero polynomial with rational coefficients. Therefore, 
Theorem 4.1 is true with R = Q and 7 in place of x. However, see 
Exercise 26. 
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26. Let Q[V2] be the set of all real numbers of the form 
ro + V2 +r AVD? +--+ + 7,(V2)", withn = Oandr, € Q. 
(a) Show that Q[ V3] is a subring of R. 


(b) Show that the function 8:Q[x] > Q[ V2] defined by 6(/(x)) = /(V2) isa 
surjective homomorphism, but not an isomorphism. Thus Theorem 4.1 is 
not true with R = Q and V2 in place of x. Compare this with Exercise 25. 
27. Let T be the set of all polynomial functions from F to F. Show that Tis a 
commutative ring with identity, with operations defined as in calculus: For 
eachr E F, 
CHSK +g) and (JEn) = fet). 
[Hint: To show that T is closed under addition and multiplication, use 


Exercise 23 to verify that f + g and fg are the polynomial functions induced 
by the sum and product polynomials f(x) + g(x) and f(x)g(x), respectively.] 


28. Let T be the ring of all polynomial functions from Z, to Z, (see Exercise 27). 


(a) Show that T is a finite ring with zero divisors. [Hint: Consider f(x) = x + 1 
and g(x) = xX? + 2x] 
(b) Show that T cannot possibly be isomorphic to Z;[x]. Then see Exercise 30. 
29. Use mathematical induction to prove Corollary 4.17. 

C. 30. If Fis an infinite field, prove that the polynomial ring F[x] is isomorphic to 
the ring T of all polynomial functions from F to F (Exercise 27). [Hint: Define 
a map ¢:F[x] —> T by assigning to each polynomial f(x) € F[x] its induced 
function in 7; ¢ is injective by Corollary 4.20.] 


31. Let g:F[x] > F[x] be an isomorphism such that g(a) = a for every a E F. 
Prove that f(x) is irreducible in F[x] if and only if p(/(x)) is. 


32. (a) Show that the map ¢:F[x] > F[x] given by o( /(x)) = f(x + 1p) is an 
isomorphism such that (a) = a foreverya E F. 


(b) Use Exercise 31 to show that f(x) is irreducible in F[x] if and only if 
Sx + 1,) is. 


45 Irreducibility in Q[x]* 


The central theme of this section is that factoring in Q[x] can be reduced to factoring 
in Z[x]. Then elementary number theory can be used to check polynomials with inte- 
ger coefficients for irreducibility. We begin by noting a fact that will be used frequently: 


If f(x) e Qx], then cf(x) has integer 
coefficients for some nonzero integer e. 


*This section is used only in Chapters 11, 12, and 15. it may be omitted until then, if desired. Section 4.6 
is independent of this section. 
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For example, consider 
2, 3 1 
=x) — — - >= 
f(x) = 3x8 + 3" + rol z 


The least common denominator of the coefficients of f(x) is 12, and 12/(x) has integer 
coefficients: 


tape) = 12st + Fat + 3? — 2 = 12% + 8x4 + 9x3 — 2. 


According to the Factor Theorem, finding first-degree factors of a polynomial 
a(x) € Q[x] is equivalent to finding the roots of g(x) in Q. Now, g(x) has the same 
roots as cg(x) for any nonzero constant c. When c is chosen so that cg(x) has integer 
coefficients, we can find the roots of g(x) by using 


Theorem 4.21 Rational Root Test 


Let Aix) = a,x" + a,x"? + +++ + ax + a bea polynomial with integer coef- 
ficients. If r # 0 and the rational number r/s (in lowest terms) is a root of f(x), 
then r|a and s|a,. 


Proof» First consider the case when s = 1, that is, the case when the integer r 
is a root of f(x), which means that apr” + ar"! + +++ + ar + a =0. 
Hence, 


which says that r divides ap. 
In the general case, we use essentially the same strategy. Since r/s is a 
root of f(x), we have 


of) a) le) va 


We need an equation involving only integers (as in the case when s = 1). 
So multiply both sides by s", rearrange, and factor as before: 


Of" + apas +++ + as yr + ag’ = 0 
(+) ag” = -a — a, — +++ — ate 
Oy? = rag! — a, sr? — ++ — ast). 
This last equation says that r divides ays", which is not quite what we 
want. However, since r/s is in lowest terms, we have (r, s) = 1. It follows 
that (r, s") = 1 (a prime that divides s" also divides s, by Corollary 1.6). 
Since r| aos" and (r, s") = 1, Theorem 1.4 shows that r| a. A similar argu- 


ment proves that s |a, (just rearrange Equation (+) so that 4,7” is on one 
side and everything else is on the other side). E 
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EXAMPLE 1 


The possible roots in Q of f(x) = 2x4 + x3 — 21x? — 14x+ 12 are of the form 
r/s, where r is one of +1, +2, +3, +4, +6, or +12 (the divisors of the constant 
term, 12) and s is +1 or +2 (the divisors of the leading coefficient, 2). Hence, 
the Rational Root Test reduces the search for roots of f(x) to this finite list of 
possibilities: 
T 13 8 
1, a A Tags » 4, g 6 —6, 1254 PARE Oy ee 
2, 3 A, 6, 2 P PP 7 
It is tedious but straightforward to substitute each of these in f(x) to find that —3 
1 
and > are the only roots of f(x) in Q.* By the Factor Theorem, both x — (—3) = 


x + 3and x — 3 are factors of f(x). Division shows that 


f(x) = (x + a(x a = 4x — 8). 


The quadratic formula shows that the roots of 2x* — 4x — 8 are 1 + V5, 
neither of which is in Q. Therefore, 2x” — 4x — 8 is irreducible in Q[x] by 
Corollary 4.19. Hence, we have factored f(x) as a product of irreducible poly- 
nomials in Q[x]. 


EXAMPLE 2 


The only possible roots of g(x) = xè + 4x? + x — 1in Q are 1 and —1 (Why?). 
Verify that neither 1 nor —1 is a root of g(x). Hence g(x) is irreducible in Q[x] 
by Corollary 4.19. 


If f(x) € Q[x], then cf(x) has integer coefficients for some nonzero integer c. Any 
factorization of ef(x) in Z[x] leads to factorization of f(x) in Q[x]. So it appears that 
tests for irreducibility in Q[x] can be restricted to polynomials with integer coefficients. 
However, we must first rule out the possibility that a polynomial with integer coeffi- 
cients could factor in Q[x] but not in Z[x]. In order to do this, we need 


Lemma 4.22 


Let f(x), g(x), A(x) © Z[x] with fx) = godh(x). If p is a prime that divides every 
coefficient of f(x), then either p divides every coefficient of g(x) or p divides 
every coefficient of A(x) 


*A graphing calculator will reduce the amount of computation significantly. Since the x-intercepts of 
the graph of y = f(x) are the roots of ftx), you can eliminate any numbers from the list that aren't near 


1 3 
— and ——. 


an intercept. In this case, the graph indicates that you need only check —3, 2 7 
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Proof > Let (x) = dy tax +++ + apt, g(x) = by + bx +--+ + bu", and 
h(x) = co + cyx + +++ + c,x". We use a proof by contradiction. If the 
lemma is false, then p does not divide some coefficient of g(x) and some 
coefficient of A(x). Let b, be the first coefficient of g(x) that is not divis- 
ible by p, and let c, be the first coefficient of A(x) that is not divisible by 
P. Then p|b, for i< rand p|c; for j < t. Consider the coefficient a,, of 
f(x). Since fix) = g(x)h(x), 

appi = Byte to! + Batti + Bey + Barc to °* + Bey 
Consequently, 
De, = Oey, — [Boras + °° + Oates] — [b416 tees + brcal- 


Now, p |a: by hypothesis. Also, p divides each term in the first pair of 
brackets because r was chosen so that p |b; for each i < r. Similarly, p 
divides each term in the second pair of brackets because p |c; for each 
j < t. Since p divides every term on the right side, we see that p | b,c,- 
Therefore, p |b, or p |c; by Theorem 1.5. This contradicts the fact that 
neither b, nor c, is divisible by p. E 


Theorem 4,23 


Let f(x) be a polynomial with integer coefficients. Then f(x) factors as a prod- 
uct of polynomials of degrees m and n in Q[x] if and only if f(x) factors as a 
product of polynomials of degrees m and n in Z[x]. 


Proof » Obviously, if A(x) factors in Z[>], it factors in Q[x]. Conversely, suppose 
F(X) = h(x) in Q[x]. Let c and d be nonzero integers such that cg(x) 
and dh(x) haveinteger coefficients. Then cdf(x) = [cg(x)][dh(x)] in Z[x] 
with deg cg(x) = deg g(x) and deg dh(x) = deg h(x). Let p be any prime 
divisor of cd, say cd = pt. Then p divides every coefficient of the polyno- 
mial edf(x). By Lemma 4.22, p divides either every coefficient of cg(x) 
or every coefficient of dh(x), say the former. Then cg(x) = pk(x) with 
k(x) € Z[x] and deg k(x) = deg g(x). Therefore, pif(x) = edf(x) = 
[eg(x)][dh(x)] = [pk(x)][dh(x)]. Canceling p on each end, we have 
tf(x) = k(x)[dh(x)] in Z[x]. 

Now repeat the same argument with any prime divisor of t and cancel 
that prime from both sides of the equation. Continue until every prime 
factor of cd has been canceled. Then the left side of the equation will be 
+ f(x), and the right side will be a product of two polynomials in Z[x], 
one with the same degree as g(x) and one with the same degree as k(x). E 


EXAMPLE 3 


We claim that f(x) = x* — 5x’ + 1 is irreducible in Q[x]. The proof is by con- 
tradiction. If f(x) is reducible, it can be factored as the product of two noncon- 
stant polynomials in Q[x]. If either of these factors has degree 1, then f(x) has 
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a root in Q. But the Rational Root Test shows that f(x) has no roots in Q. (The 
only possibilities are +1, and neither is a root.) Thus if f(x) is reducible, the 
only possible factorization is as a product of two quadratics, by Theorem 4.2. 
In this case Theorem 4.23 shows that there is such a factorization in Z[x]. 
Furthermore, there is a factorization as a product of monic quadratics in Z[x] 
by Exercise 10, say 


(x? + ax + b)(x7 + cx +d) = x4 - 5x7 +1 
with a, b, c,d € Z. Multiplying out the left-hand side, we have 


xt + (a+ c)x? + (ac +b + dye + (be + ad)x + bd 
= x4 + 0x — 5x7 + Ox + 1. 


Equal polynomials have equal coefficients; hence, 
a+c=0 actb+d=—-5 betad=0 bd = 1. 
Since a + c = 0, we have a = —¢, so that 


-S=actbt+d=-c+b+d, 


or, equivalently, 
S=c-b-d. 
However, bd = 1 in Z implies that b = d= 1 orb = d= —1, and so there are only 
these two possibilities: 
S5=A-1-1 o S=e+1+1 
7=¢ 3=e7, 


There is no integer whose square is 3 or 7, and so a factorization of f(x) asa 
product of quadratics in #[x], and, hence in Q[x], is impossible. Therefore, f(x) 
is irreducible in Qf[x]. 


The brute-force methods of the preceding example are less effective for polynomi- 
als of high degree because the system of equations that must be solved is complicated 
and difficult to handle in a systematic way. However, the irreducibility of certain poly- 
nomials of high degree is easily established by 


Theorem 4.24 Eisenstein’s Criterion 


Let f(x) = apr? + +++ + ax + a be a nonconstant polynomial with integer 
coefficients. If there is a prime p such that p divides each of ap, a1, + + + ı 8n-1 
but p does not divide a, and p° does not divide a), then f(x) is irreducible in Q[x]. 


Proof> The proof is by contradiction. If f(x) is reducible, then by Theorem 4.23 
it can be factored in Z[x], say 


SC) = (bo + bix ++ + + b, X Xeo + yx + +++ + ex, 
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where each b, c; E Z, r = 1, and s = 1. Note that ag = baco. By hypothe- 
sis, p |a and, hence, p | bg or p|% by Theorem 1.5, say p | bọ. Since p? does 
not divide ap, we see that c is not divisible by p. We also have a, = 5,«,. 
Consequently, p does not divide b, (otherwise a, would be divisible by p, 
contrary to hypothesis). There may be other b, not divisible by p as well. 
Let b; be the first of the 5, not divisible by p; then 0 < k = r < n and 


plbfori<k and přbę 
By the rules of polynomial multiplication, 
ap = boCp + Dycy_y +++ + br-e + Deco, 
so that 
Dreo = ag — bacy — birai — +> + bgt: 


Since p |a;, and p |b, for i < k, we see that p divides every term on the 
right-hand side of this equation. Hence, p |bgco. By Theorem 1.5, p must 
divide b or co. This contradicts the fact that neither bx nor co is divisible 
by p. Therefore, f(x) is irreducible in Q[x]. m 


EXAMPLE 4 


The polynomial x!? + 6x!3 — 15x‘ + 3x? — 9x + 12 is irreducible in Q[x] by 
Kisenstein’s Criterion with p = 3. 


EXAMPLE 5 


The polynomial x? + 5 is irreducible in Q[x] by Eisenstein’s Criterion with 
p = 5. Similarly, x” + 5 is irreducible in Q[x] for each n = 1. Thus 


there are irreducible polynomials of every degree in Q|x]. 


Although Eisenstein’s Criterion is very efficient, there are many polynomials to 
which it cannot be applied. In such cases other techniques are necessary. One such 
method involves reducing a polynomial mod p, in the following sense. Let p be a posi- 
tive pane For each integer a, let [a] denote the congruence class of a in Z,, If f(x) = 
at tee + axt % is a polynomial with integer coefficients, let f(x) rains the 
hes [ah + --~ + [aJe + [a] in Z [x]. For instance, if f(x) = 2x4 — 3x7 + 
5x + 7 in [x], then in Zi, 


Fœ) = [4 — [3]x? + [Sx + [7] 
= [2x — [0]? + [2]x + [1] = [2]x* + [2]x + [1]. 
Notice that f(x) and f(x) have the same degree. This will always be the case 


when the leading coefficient of f(x) is not divisible by p (so that the leading 
coefficient of f(x) will not be the zero class in Z,). 
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Theorem 4,25 


Let f(x) = ayé + - + - + ax + a be a polynomial with integer coefficients, and 
let p be a positive prime that does not divide a,. If f(x) is irreducible in Zx), 
then f(x) is irreducible in Q[x]. 


Proof» Suppose, on the contrary, that f(x) is reducible in Q[x]. Then by 
Theorem 4.23, f(x) = g(x)A(x) with g(x), A(x) nonconstant polynomials 
in £[x]. Since p does not divide az, the leading coefficient of f(x), it 
cannot divide the leading coefficients of g(x) or A(x) (whose product is 
a,). Consequently, deg g(x) = deg g(x) and deg A(x) = deg A(x). In par- 
ticular, neither g(x) nor A(x) is a constant polynomial in Zx]. 

Verify that f(x) = 9(x)h(x) in Z[x] implies that f(x) = g(x)A(x) in 
Z [x] (Exercise 20). This contradicts the irreducibility of f(x) in £,[x]. 
Therefore, f(x) must be irreducible in Q[x]. m 


The usefulness of Theorem 4.25 depends on this fact: For each nonnegative in- 
teger k, there are only finitely many polynomials of degree k in Z, [x] (Exercise 17). 
Therefore, it is always possible, in theory, to determine whether a given polynomial in 
Z,{x] is irreducible by checking the finite number of possible factors. Depending on 
the size of p and on the degree of /{x), this can often be done in a reasonable amount 
of time. 


EXAMPLE 6 


To show that f(x) = xê + 8x4 + 3x? + 4x + 7 is irreducible in Q[x], we reduce 
mod 2. In Z{x], f(x) = x° + x + 1.* It is easy to see that f(x) has no roots in 
Z, and hence no first-degree factors in Z,[x]. The only quadratic polynomials in 
Z.[x] are x’, x2 + x, x* + 1, and x* + x + 1. However, if x?, x7 + x = x(x +1), 
or x? + 1 = (x + 1)(x + 1) were a factor, then f(x) would have a first-degree 
factor, which it doesn’t. You can use division to show that the remaining qua- 
dratic, x? + x + 1, is not a factor of f(x). Finally, f(x) cannot have a factor 
of degree 3 or 4 (if it did, the other factor would have degree 2 or 1, which is 
impossible). Therefore, f(x) is irreducible in Z,[x]. Hence, f(x) is irreducible 

in Q[x]. 


CAUTION: If a polynomial in Z[x] reduces mod p to a polynomial that 
is reducible in Z [x], then no conclusion can be drawn from 
Theorem 4.25. Unfortunately, there may be many p for 
which the reduction of f(x) is reducible in Z,[], even when 
fœ) is actually irreducible in Q[x]. Consequently, it may 
take more time to apply Theorem 4.25 than is first apparent. 


*When no contusion is likely, we omit the brackets for elements of Z}. 
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E Exercises 


A. 1. 


B.11. 
. Let F be a field and f(x) € F[x]. If e e Fand f(x + o) is irreducible in F[x], 


Use the Rational Root Test to write each polynomial as a product of irreduc- 
ible polynomials in Q[x]: 


(a) —x4 4224+ 27% +x4+2 (b) x$ + 4x4 + x3 — x? 
(c) 3x° +24 — 7x3 4 237 (d) 2x* — 5x + 3x7 + 4x —6 
(e) 2x* + 723+ 5x24 7x + 3 (£) 6x* — 31x? + 25x + 33x + 7 


. Show that Vp is irrational for every positive prime integer p. [Hint: What are 


the roots of x* — p? Do you prefer this proof to the one in Exercises 30 and 31 
of Section 1.37] 


. If a monic polynomial with integer coefficients has a root in Q, show that this 


root must be an integer. 


. Show that each polynomial is irreducible in Q[x], as in Example 3. 


(a) x4+2x3+x41 (b) x4 — 2x? + 8x + 1 


. Use Eisenstein’s Criterion to show that each polynomial is irreducible in Q[x]: 


(a) x° — 4x + 22 (b) 10 — 15x + 25x? — 7x* 
(c) 5x!! — 6x* + 12x3 + 36x — 6 


. Show that there are infinitely many integers k such that x + 12x3 — 21x +k 


is irreducible in Q[x]. 


. Show that each polynomial f(x) is irreducible in Q[x] by finding a prime p 


such that f(x) is irreducible in Z,[>] 
(a) 7x? + 6x7 + 4x +6 (b) 9x4 + 4x3 — 3x47 


. Give an example of a polynomial f(x) € Z[x] and a prime p such that f(x) 


is reducible in Q[x] but f(x) is irreducible in Z,[x]. Does this contradict 
Theorem 4.25? 


. Give an example of a polynomial in Z[x] that is irreducible in Q[x] but factors 


when reduced mod 2, 3, 4, and 5. 


. If a monic polynomial with integer coefficients factors in Z[x] as a product of 


polynomials of degrees m and n, prove that it can be factored as a product of 
monic polynomials of degrees m and n in Z[x]. 


Prove that 30x" — 91 (where n E Z, n > 1) has no roots in Q. 


prove that f(x) is irreducible in F[x]. [Hint: Prove the contrapositive.] 


. Prove that f(x) = x* + 4x + 1 is irreducible in Q[x] by using Eisenstein’s 


Criterion to show that f(x + 1) is irreducible and applying Exercise 12. 


. Prove that f(x) = x4 + x? + x? + x + lis irreducible in Q[x]. [Hint: Use the 


hint for Exercise 21 with p = 5.] 


. Let f(x) = ap” + ayy! + +++ + ax + a be a polynomial with integer 


coefficients. If p is a prime such that p | a4, p |a, . . - , p |a, but p } ay and 
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P’ X an, prove that f(x) is irreducible in Q[x]. [Hint: Let y = 1/x in f(x)/2’; the 
resulting polynomial is irreducible, by Theorem 4.24.] 


16. Show by example that this statement is false: If f(x) € Z[x] and there is no 
prime p satisfying the hypotheses of Theorem 4.24, then f(x) is reducible in O[>x]. 


17. Show that there are “+! — x* polynomials of degree k in Z,[x]. 

18. Which of these polynomials are irreducible in Q[x]: 
(a) 4-3? +1 O) A +x4+1 
(c) P+ 4x44 24 3x7-x+5 (d O4+5x7°4+4x4+7 

19. Write each polynomial as a product of irreducible polynomials in Q[x]. 
(a) x° + 2x‘ — 6x7 — 16x — 8 (b) x’ — 2x5 — 6x4 — 15x? — 33x — 9 


20. If f(x) = a,x" + +++ + ax + ap, B(x) = Bp? + +++ + bix + bo and A(x) = 
epi tess +axt+a are polynomials in Z[x] such that f(x) = g(x)A(x), show 
that in Z,[x], f(x) = 2(x)A(x). Also, see Exercise 19 in Section 4.1. 


C.21. Prove that for p prime, f(x) = x?! + xf"? + + +- + x? + x + 1is irreducible 
in Q[x]. [Hint: (x — ) f(x) = £ — 1, so that f(x) = (P — 1)/(x — 1) and 
fx +1) = [(x + IP —1]/x. Expand (x + 1)? by the Binomial Theorem 


(Appendix E) and note that p divides k when k > 0. Use Eisenstein’s 
Criterion to show that f{x + 1) is irreducible; apply Exercise 12.] 


EXCURSION: Geometric Constructions (Chapter 15) may be covered at 


this point if desired. 


46 Irreducibility in R[x] and C[x]* 


Unlike the situation in Qf[x], it is possible to give an explicit description of all the irre- 
ducible polynomials in R[x] and C[x]. Consequently, you can immediately tell if a poly- 
nomial in R[x] or C[x] is irreducible without any elaborate tests or criteria. These facts 
are a consequence of the following theorem, which was first proved by Gauss in 1799: 


Theorem 4.26 The Fundamental Theorem of Algebra 


Every nonconstant polynomial in C[x] has a root in C. 


This theorem is sometimes expressed in other terminology by saying that the field 
C is algebraically closed. Every known proof of the theorem depends significantly on 
facts from analysis and/or the theory of functions of a complex variable. For this rea- 
son, we shall consider only some of the implications of the Fundamental Theorem on 
irreducibility in C[x] and R[x]. For a proof, see Hungerford [5]. 


*This section is used only in Chapters 11 and 12. It may be omitted until then, if desired. 
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Corollary 4.27 


A polynomial is irreducible in C[x] if and only if it has degree 1. 


Proof +a polynomial f(x) of degree = 2 in C[x] has a root in C by Theorem 4.26 
and hence a first-degree factor by the Factor Theorem. Therefore f(x) is 
reducible in C[x], and every irreducible polynomial in C[x] must have 
degree 1. Conversely, every first-degree polynomial is irreducible 
(Example 1 in Section 4.3). W 


Corollary 4.28 


Every nonconstant polynomial f(x) of degree n in C[x] can be written in the 
form c(x — a)(x — ao) «+ + (x — an) for some c, a4, a, ..., a, E C. This factor- 
ization is unique except for the order of the factors. 


Proof >By Theorem 4.14, f(x) is a product of irreducible polynomials in C[x]. 
Each of them has degree 1 by Corollary 4.27, and there are exactly n of 
them by Theorem 4.2. Therefore, 


F(X) = (rix + sira + s3) ++ > rx + Sp) 
= r(x — (Sr rae — (rr) +» + ye (r S) 
= x — a(x — a) +++ (x — a), 
where c = rr, +++ r, and a, = r,;'s, Uniqueness follows from Theorem 4.14; 


see Exercise 25 in Section 4.3. E 


To obtain a description of all the irreducible polynomials in R[x], we need 


Lemma 4,29 


If f(x) is a polynomial in R[x] and a + bj is a root of f(x) in C, then a — biż is also 
a root of f(x). 


Proof > If c =a + bi € C (with a, b € R), let c denote a — bi. Verify that for 
any c,d E C, 
(et+d=ct+d and cd=cd. 
Also note that ¢ = c if and only if c is a real number. Now, if f(x) = ax? + 
+ + a,x + a and cis a root of f(x), then f(e) = 0, so that 
wish saF Saat a 
= a,c" +--+ a6 + ay 
=a," +--+ a¢ +a, [Because each a,ER.] 
= f(e). 
Therefore ¢ = a — bi is also a root of f(x). E 
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Theorem 4,30 


A polynomial f(x) is irreducible in R[x] if and only if f(x) is a first-degree poly- 
nomial or 


fo =aet+bx+c with b? ~ 4ac <0. 


Proof> The proof that the two kinds of polynomials mentioned in the theo- 
rem are in fact irreducible is left to the reader (Exercise 7). Conversely, 
suppose f(x) has degree = 2 and is irreducible in R[x]. Then f(x) has a 
root win C by Theorem 4.26. Lemma 4.29 shows that w is also a root 
of f(x). Furthermore, w + w (otherwise w would be a real root of f(x), 
contradicting the irreducibility of /(x)). Consequently, by the Factor 
Theorem, x — w and x — w are factors of f(x) in C[-x]; that is, (x) = 
(x — w)(x — w)h(x) for some A(x) in Cix]. Let g(x) = (x — w)(x — w) 
then f(x) = g(h(x) in C[x]. Furthermore, if w = r + si (with r, s e R), 
then 


g(x) = (x — wx — w) = (x — (r + si) — (r — si) 
=x — 2x + (r+ P. 


Hence, the coefficients of g(x) are real numbers. 

We now show that A(x) also has real coefficients. The Division 
Algorithm in R[x] shows that there are polynomials q(x), r(x) in R[x] 
such that f(x) = g(x)q(x) + r(x), with r(x) = 0 or deg r(x) < deg g(x). In 
C[x], however, we have f(x) = g(x)h(x) + 0. Since g(x) and r(x) can be 
considered as polynomials in C[x], the uniqueness part of the Division 
Algorithm in C[x] shows that q(x) = A(x) and r(x) = 0. Thus A(x) = 
q(x) € R[x]. Since f(x) = g(x)A(x) and f(x) is irreducible in R[x] and 
deg g(x) = 2, A(x) must be a constant d € R. Consequently, f(x) = dg(x) 
is a quadratic polynomial in R[x] and hence has the form ax? + bx + c 
for some a, b, c E R. Since f(x) has no roots in R, the quadratic formula 
(Exercise 6) shows that b? — 4ac < 0. E 


Corollary 4.31 


Every polynomial f(x) of odd degree in R[x] has a root in R. 


Proof» By Theorem 4.14, f(x) = p,(x)p2(x) * * * P with each p(x) irreduc- 
ible in R[x]. Each p;(x) has degree 1 or 2 by Theorem 4.30. Theorem 4.2 
shows that 


deg f(x) = deg pi(x) + deg p(x) + - ++ + deg px). 


Since f(x) has odd degree, at least one of the p,(x) must have degree 1. 
Therefore, f(x) has a first-degree factor in R[x] and, hence, a root in R. E 
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It may seem that the Fundamental Theorem and its corollaries settle all the basic 
questions about polynomial equations. Unfortunately, things aren’t quite that simple. 
None of the known proofs of the Fundamental Theorem provides a constructive way 
to find the roots of a specific polynomial.* Therefore, even though we know that every 
polynomial equation has a solution in C, we may not be able to solve a particular 
equation. 

Polynomial equations of degree less than 5 are no problem. The quadratic formula 
shows that the solutions of any second-degree polynomial equation can be obtained 
from the coefficients of the polynomials by taking sums, differences, products, quotients, 
and square roots. There are analogous, but more complicated, formulas involving cube 
and fourth roots for third- and fourth-degree polynomial equations (see page 423 for one 
version of the cubic formula). However, there are no such formulas for finding the roots 
of all fifth-degree or higher-degree polynomials. This remarkable fact, which was proved 
nearly two centuries ago, is discussed in Section 12.3. 


E Exercises 


A. 1. Find all the roots in C of each polynomial (one root is already given): 

(a) x* — 3x? + x7 + 7x — 30; root 1 — 2i 
(b) x4 — 2x9 — xX + 6x — 6;root l + i 
(c) x4 — 4x7 + 3x7 + 14x + 26; root 3 + 2i 

2. Find a polynomial in R[x] that satisfies the given conditions: 
(a) Monic of degree 3 with 2 and 3 + ias roots 
(b) Monic of least possible degree with 1 — i and 2i as roots 
(€) Monic of least possible degree with 3 and 4i — 1 as roots 

3. Factor each polynomial as a product of irreducible polynomials in Q[x], in 
R[x], and in Cix]: 
(a) x -2 (b) +1 (c) x8 — 27 — 5x +5 

4. Factor xX? + x + 1+ iin C[x]. 


B. 5. Show that a polynomial of odd degree in R[x] with no multiple roots must 
have an odd number of real roots. 


*It may seem strange that it is possible to prove that a root exists without actually exhibiting one, 
but such “existence theorems” are quite common in mathematics. A very rough analogy is the 
situation that occurs when a person is killed by a sniper’s bullet. The police know that there is a 
killer, but actually finding the killer may be difficult or impossible. 
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6. Let f(x) = ax? + bx + ¢ © R[x] with a + 0. Prove that the roots of f(x) in C 
are 


-b + VE — 4ac -b — Vb" — 4ac 
—_—.——_ and ———.. 
2a 2a 
[Hint: Show that ax? + bx + c = 0 is equivalent to x? + (b/a)x = —c/a; then 
complete the square to find x.] 


7. Prove that every ax” + bx + c e R[x] with b — 4ac < 0 is irreducible in R[x]. 
[Hint: See Exercise 6]. 


8. If a + biisa root of x? — 3x? + 2ix + i — 1 © Cx], then is it true that a — bi 
is also a root? 
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CHAPTER 4 


Congruence in F[x] and Congruence-Class Arithmetic 


In this chapter we continue to explore the analogy between the ring Z of integers 
and the ring F[x] of polynomials with coefficients in a field F. We shall see that the 
concepts of congruence and congruence-class arithmetic carry over from Z to 
F[x] with practically no changes. Because of the additional features of the polyno- 
mial ring F[x] (polynomial functions and roots), these new congruence-class rings 
have a much richer structure than do the rings Z,. This additional structure leads 
toa striking result: Given any polynomial over any field, we can find a root of that 
polynomial in some larger field. 


[en Congruence in F[x] and Congruence Classes 


The concept of congruence of integers depends only on some basic facts about divisibility 
in Z. If Fis a field, then the polynomial ring F[x] has essentially the same divisibility 
properties as does Z. So it is not surprising that the concept of congruence in Z and its 
basic properties (Section 2.1) can be carried over to F[x] almost verbatim. 


Definition Let F be a field and f(x), g(x), p(x) e Ax] with p(x) nonzero. Then f(x) is 
congruent to g(x) modulo p{x}—written f(x) = g(x) (mod p(x)}provided 
that p(x) divides f(x) — g(x). 


EXAMPLE 1 


In Q(x], x7 + x + 1 = x + 2 (mod x + 1) because 
(x? +x +1) —(x +2) = 7-1 = (xt Ix - 1). 


125 
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Definition 


EXAMPLE 2 


In R[x], 3x4 + 4x? + 2x +2 =x + 3x7 + 3x + 4 (mod x? + 1) because 
division shows that 
(3x4 + 4x? + 2x + 2) — (09 + 3x? + 3x + 4) = 3x4 ae +? -—x -2 
= (x? + 1)(3x — x — 2). 


Theorem 5.1 
Let F be a field and p(x) a nonzero polynomial in [x]. Then the relation of 
congruence modulo p(x) is 
(1) reflexive: f(x) = f(x) (mod p(x)) for all f(x) €e Fix]; 
(2) symmetric: if f(x) = g(x) (mod p(x)), then g(x) = f(x) (mod p(x)); 
(8) transitive: if f(x) = g(x) (mod p(x)) and g(x) = A(x) (mod p(x)), then 
fix) = A(x) (mod p(x). 


Proof» Adapt the proof of Theorem 2.1 with p(x), f(x), g(x), A(x) in place of 
a, a,b,c. M 


Theorem 5.2 


Let F be a field and p(x) a nonzero polynomial in F[x]. tf f(x) = g(x) (mod p(x)) 
and A(x) = k(x) (mod p(x)), then 

(1) Ax) + A(x) = g(x) + k(x) (mod p(x), 

(2) fod O) = g(x)k(x) (mod p{x)). 


Proof» Adapt the proof of Theorem 2.2 with p(x), f(x), gx), A(x), k(x) in place 
of n, a,b,c,d. B 


Let F bea field and f(x), p(x} e F[x] with p(x) nonzero. The congruence class 
(or residue class) of f(x) modulo p(x) is denoted [f(x)] and consists of all 
polynomials in Ax] that are congruent to f(x) modulo p(x), that is, 


[f(x)] = {g(x) | g(x) € Ax] and g(x) = f(x) (mod p(x))}. 


Since g(x) = f (x) (mod p(x)) means that g(x) — f(x) = k(x)p(x) for some k(x) € Fx] 


or, equivalently, that g(x) = f(x) + k(x)p(x), we see that 


ON = ie le) = f) (mod p(x))} 
= f(x) + kop) | k(x) E FE}. 
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EXAMPLE 3 


Consider congruence modulo x? + 1 in R[x]. The congruence class of 2x + 1 is 
the set 


{(2x + 1) + k(x)? + 1) [k(x) e REX}. 


The Division Algorithm shows that the elements of this set are the polynomials in R[x] 
that leave remainder 2x + 1 when divided by x7 + 1. 


EXAMPLE 4 


Consider congruence modulo x? + x + 1 in Z,[x]. To find the congruence 
class of x?, we note that x? = x + 1 (mod x? + x + 1) because x? — (x + 1) = 
x? — x — 1 = (x? + x + 1)1 (remember that 1 + 1 = 0 in Z}, so that 1 = —1). 
Therefore, x + 1 is a member of the congruence class [x?]. In fact, the next 
theorem shows that [x + 1] = [x’]. 


Theorem 5.3 
f(x) = g(x) (mod p(x) if and only if [7(x)] = [g(x)]. 


Proof» Adapt the proof of Theorem 2.3 with f(x), g(x), p(x), and Theorem 5.1 
in place of a, c, n, and Theorem 2.1. WE 


Corollary 5.4 


Two congruence classes modulo p(x) are either disjoint or identical. 


Proof» Adapt the proof of Corollary 2.4. E 


Under congruence modulo x in Z, there are exactly z distinct congruence classes 
(Corollary 2.5). These classes are [0], [1], . .., [” — 1]. Note that there is a class for each 
possible remainder under division by n. In F[x] the possible remainders under divi- 
sion by a polynomial of degree n are all the polynomials of degree less than z (and, of 
course, 0). So the analogue of Corollary 2.5 is 


Corollary 5.5 


Let F be a field and p(x) a polynomial of degree n in F[x], and consider congru- 
ence modulo p(x). 
(1) Hf f(x) e Fix] and r(x) is the remainder when f(x) is divided by p(x), then 
o) = iro]. 
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(2) Let S be the set consisting of the zero polynomial and all the poly- 
nomials of degree less than n in Ax]. Then every congruence class 
modulo p(x) is the class of some polynomial in S, and the congru- 
ence classes of different polynomials in $ are distinct. 


Proof» 1) By the Division Algorithm, f(x) = p(x)q(x) + r(x), with r(x) = Opor 
deg r(x) < n. Thus, f(x) — (x) = p(x)q(2), so that f(x) = r(x) (mod p(x)). 
By Theorem 5.3, [ f(x)] = P9]. 

(2) Since r(x) = 07 or deg r(x) < n, we see that r(x)e S. Hence, every 

congruence class is equal to the congruence class of a polynomial in S. 
Two different polynomials in S cannot be congruent modulo p(x) because 
their difference has degree less than z, and hence is not divisible by p(x). 
Therefore, different polynomials in S must be in distinct congruence 
classes by Theorem 5.3. @ 


The set of all congruence classes modulo p(x) is denoted 


F[xI/(p@), 


which is the notational analogue of Z,. 


EXAMPLE 5 


Consider congruence modulo x? + 1 in R[x]. There is a congruence class for 
each possible remainder on division by x? + 1. Now, the possible remainders 
are polynomials of the form rx + s (with r, s € R; one or both of r, s may 
possibly be 0). Therefore, R[x]/(x? + 1) consists of infinitely many distinct 
congruence classes, including 


[ol ix] [x + 11, [x + 3] E + 2} — a 


Corollary 5.5 states that [rx + s] = [ex + d] if and only if rx + s is equal (not 
just congruent) to ex + d. By the definition of polynomial equality, rx + s = 
ex + dif and only if r = cand s = d. Therefore, every element of R[xJ/(x? + 1) 
can be written wriquely in the form [rx + s]. 


EXAMPLE 6 


Consider congruence modulo x? + x + 1 in Z,[x]. The possible remainders on 
division by x? + x + 1 are the polynomials of the form ax + b with a, bE Z} 
Thus there are only four possible remainders: 0, 1, x, and x + 1. Therefore, 
Z2[x]/(? + x + 1) consists of four congruence classes: [0], [1], [x], and [x + 1). 


EXAMPLE 7 


The pattern in Example 6 works in the general case. Let n be a prime integer, 
so that Z, is a field and the Division Algorithm holds in Z,[x]. If p(x) € Z,[x] 
has degree k, then the possible remainders on division by p(x) are of the form 
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QB + ayxtee s+ appt, with aE Z,„. There are n possibilities for each of 
the k coefficients a, . . . ,a@,_,,and so there are #* different polynomials of this 
form. Consequently, by Corollary 5.5, there are exactly #* distinct congruence 
classes modulo p(x) in Z,[x]/(p(x)). 


E Exercises 


NOTE: F denotes a field and p(x) a nonzero polynomial in F[x]. 
A. 1. Let f(x), g(x), p(x) € F[x], with p(x) nonzero. Determine whether f(x) = g(x) 
(mod p(x)). Show your work. 
(a) f(x) = 2x0 — 2x4 + 423 + x + 1; g(x) = 3x4 + 22 — 5x2 — 9; 
P(x)=H?+1;F=Q 
b) f(x) = x4 +22 4+ x4 l; g(x) = x4 4+ P4741; 
P(x) =?+x;F=Z, 
(c) f(x) = 3x8 + 4x4 + 5x3 — 627 + 5x — 7; 
g(x) = 2x + 6x4 + x9 + 2x7 + 2x — 5; p(x) = 8-2 +x-1;F=R 
2. If p(x) is a nonzero constant polynomial in F[x], show that any two 
polynomials in F[x] are congruent modulo p(x). 


3. How many distinct congruence classes are there modulo x° + x + 1 in Zad]? 
List them. 


4, Show that, under congruence modulo x? + 2x + 1 in Z,[x], there are exactly 
27 distinct congruence classes. 
5. Show that there are infinitely many distinct congruence classes modulo x’ — 2 
in Q[x]. Describe them. 
6. Let æE F. Describe the congruence classes in F [x] modulo the polynomial x — a. 
7. Describe the congruence classes in F[x] modulo the polynomial x. 
8. Prove or disprove: If p(x) is relatively prime to k(x) and f(x)k(x) = g(x)k(x) 
(mod p(x)), then f(x) = g(x) (mod p(x). 
9. Prove that f(x) = g(x) (mod p(x)) if and only if x) and g(x) leave the same 
remainder when divided by p(x). 
10. Prove or disprove: If p(x) is irreducible in F[x] and f(x)g(x) = Or (mod p(x)), 
then fx) = 0; (mod p(x)) or g(x) = 0; (mod p(x)). 
11. If p(x) is reducible in F[x], prove that there exist f(x), g(x) € F[x] such that 
f=) # Or (mod p(x)) and g(x) # Or (mod p(x)) but f(x)g(x) = Or (mod p(x). 
12. If f(x) is relatively prime to p(x), prove that there is a polynomial g(x) € F[x] 
such that f(x)g(x) = 1p (mod p(x)). 


13. Suppose f(x), g(x) e R[x] and f(x) = g(x) (mod x). What can be said about the 
graphs of y = f(x) and y = g(x)? 
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52 Congruence-Class Arithmetic 


Congruence in the integers led to the rings Z,„. Similarly, congruence in F[x] also pro- 
duces new rings and fields. These turn out to be much richer in structure than the rings 
Z@,,. The development here closely parallels Section 2.2. 


Theorem 5.6 


Let F be a field and p(x) a nonconstant polynomial in FLX]. If [f(x)] = [g(x)] and 
[A(x)] = [k(x] in FLx]/(p(00), then, 


(Ax) + AQ] = fo) +k] and [foh] = goko]. 
Proof» Copy the proof of Theorem 2.6, with Theorems 5.2 and 5.3 in place of 
Theorems 2.2 and 2.3. Em 


Because of Theorem 5.6 we can now define addition and multiplication of con- 
gruence classes just as we did in the integers and be certain that these operations are 
independent of the choice of representatives in each congruence class. 


Definition Let F be a field and p(x) a nonconstant polynomial in F[x]. Addition and 


multiplication in F[x]/(p(x)) are defined by 
[AX] + Lg) = [f0 + gò), 
Ng] = fg). 


EXAMPLE 1 


Consider congruence modulo xX? + 1 in R[x]. The sum of the classes [2x + 1] 
and [3x + 5] is the class 


[(2x + 1) + (3x + 5)] = [5x + 6]. 
The product is 
[2x + 1][3x + 5] = [(2x + 1(3x + 5)] = [6x7 + 13x + 5]. 


As noted in Example 5 of Section 5.1, every congruence class in R[x]/(x? + 1) 
can be written in the form [ax + b]. To express the class [6x7 + 13x + 5] in this 
form, we divide 6x? + 13x + 5 by x? + 1 and find that 

6x7 + 13x + 5 = 6(x7 + 1) + (13x — 1). 


It follows that 6x7 + 13x + 5 =13x— 1 (mod 2 + 1), and hence [6x7 + 13x + 5] = 
[13x — 1]. 
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EXAMPLE 2 


In Example 6 of Section 5.1, we saw that Zfx]/(x? + x + 1) consists of four 
classes: [0], [1], [x], and [x + 1]. Using the definition of addition of classes, 
we see that [x + 1] + [1] = [x + 1 + 1] = [x] (remember that 1 + 1 = 0 

in 2,). Similar calculations produce the following addition table for 

Zx\/ (x? + x + 1): 


+ [0] [1] ' [x +1] 
[0] [0] [1] [x] [x + 1] 
W ua CU DN a e a 3 E E 
[x] [x] [Ix+1] O [1] 
+1) | +) [J [1] [0] 
Most of the multiplication table for Z,[x]/(x? + x + 1) is easily obtained from 
the definition: 
| o t (bd +l 
[0] [o) [9] O A 
N i A i ASO: SONS ib] wN 
[x] 0O 


[x + 1] [0] [x + 1] 


To fill in the rest of the table, note, for example, that 
[x] > [x + 1] = P(x + 1) = [2 + x]. 


Now division or simple addition in Z,[x] shows that x? + x = (2 +x+1)4+1. 
Therefore, x7 + x = 1 (mod x? + x + 1), so that [x? + x] = [1]. A similar calcu- 
lation shows that [x] + [x] = [xf] = [x + 1] (because x? = (x? + x + 1) + (x + 1) 
in Z,[x]). Verify that [x + 1][x + 1] = [x]. 


If you examine the tables in the preceding example, you will see that 
Zx] + x + 1) is a commutative ring with identity (in fact, a field). In view 
of our experience with Z and Z,, this is not too surprising. What is unexpected is the 
upper left-hand corners of the two tables (the sums and products of [0] and [1]). It is 
easy to see that the subset F* = {[0],[1]} is actually a subring of Z,[x]/(x? + x + 1) 
and that F* is isomorphic to Z, (the tables for the two systems are identical except for 
the brackets in F*). These facts illustrate the next theorem. 


Theorem 5.7 


Let F be a field and p(x) a nonconstant polynomial in A[x]. Then the set 
F[x]/(p(x)) of congruence classes modulo p(x) is a commutative ring with 
identity. Furthermore, F[x]/(p(x)) contains a subring F* that is isomorphic to F. 
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Proof» To prove that F[x]/(p(x)) is a commutative ring with identity, adapt the 
proof of Theorem 2.7 to the present case. Let F* be the subset of 
F[x]/(p(x)) consisting of the congruence classes of all the constant 
polynomials; that is, F* = {[a]|ae F}. Verify that F* is a subring of 
F[x]/((x)) (Exercise 10). Define a map g:F — F* by g(a) = [a]. This 
definition shows that ¢ is surjective. The definitions of addition and 
multiplication in F[x]/(p(2x)) show that 


gla + b) = [a + b] = [a] + [b] = p(a) + ob) and 
(ab) = [ab] = [a] - [6] = g(a) ` pb). 
Therefore, g is a homomorphism. 
To see that @ is injective, suppose g(a) = ¢(b). Then [a] = [b], so that 
a= b (mod p(x)). Hence, p(x) divides a — b. However, p(x) has degree = 1, 


and a — bE F. This is impossible unless a — b = 0. Therefore, a = b and 
¢ is injective. Thus g:F— F* isan isomorphism. @ 


We began with a field F and a polynomial p(x) in F[x]. We have now constructed a 
ring F[x]/(p(x)) that contains an isomorphic copy of F. What we would really like is a 
ring that contains the field F itself. There are two possible ways to accomplish this, as 
illustrated in the following example. 


EXAMPLE 3 


In Example 2, we used the polynomial x? + x + 1 in Z,[x] to construct the ring 
Zx]/(x? + x + 1), which contains a subset F* = {[0], [1]} that is isomorphic to 
Z,. Suppose we identify Z, with its isomorphic copy F* inside Z[x]/(x? + x + 1) 
and write the elements of F* as if they were in Z}. Then the tables in Example 2 
become 


x 0 1 [x] [x + 1] 
0 0 0 0 0 

1 0 1 [x] [x + 1] 
[x] 0 [x] [eka] 1 

[x + 1] 0 [x+1] 1 [x] 


We now have a ring that has Z, as a subset. If this procedure makes you a bit 
uneasy (is Z, really a subset?), you can use the following alternate route to the 
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same end. Let E be any four-clement set that actually contains Z, as a subset, 
say E = {0, 1, r,s}. Define addition and multiplication in E by 


+0 Il py a O L y h 
00 1 r g 0;0 0 0 0 
riL 0O g p ta T wn g 
Fil, T poaa PIO & «& A 
olg aw L 0 ciO w t g 


A comparison of the tables for Z,[x]/(x7 + x + 1) and those for E shows that 
these two rings are isomorphic (replacing [x] by r and [x + 1] by s changes 
one set of tables into the other). Therefore, E is essentially the same ring we 
obtained before. However, E does contain Z, as an honest-to-goodness subset, 
without any identification. 


What was done in the preceding example can be done in the general case. Given 
a field F and a polynomial p(x) in F[x], we can construct a ring that contains F as 
a subset. The customary way to do this is to identify F withits isomorphic copy F* 
inside F[x]/(p(x)) and to consider F to be a subset of F[x]/(p(x)). If doing this 
makes you uncomfortable, keep in mind that you can always build a ring isomorphic 
to F[x]/(p(x)) that genuinely contains F as a subset, as in the preceding example. 
Because this latter approach tends to get cumbersome, we shall follow the usual 
custom and identify F with F* hereafter. Consequently, when a, b € F, we shall write 
b[x] instead of [b][x] and a + b[x]instead of [a] + [5][x] = [a + bx]. Then Theorem 5.7 
can be reworded: 


Theorem 5.8 


Let F be a field and p(x) a nonconstant polynomial in F[x]. Then ALx]/(p(x)) is a 
commutative ring with identity that contains F. 


If a and n are integers such that (a, n) = 1, then by Theorem 2.10, [a] is a unit in Z,. 
Here is the analogue for polynomials. E 


Theorem 5,9 


Let F be a field and p(x) a nonconstant polynomial in FLx]. lf f(x) € F [x] and f(x) 
is relatively prime to p(x), then [f(x)] is a unit in F[x]/(p(x)). 


Proof » By Theorem 4.8 there are polynomials u(x) and v(x) such that f(x)u(x) + 
P(x)e(x) = 1. Hence, f(x)u(x) — 1 = —plx)x) = p(x)(—v(x)), which 
implies that [/(x)«(x)] = [1] by Theorem 5.3. Therefore, [/(x)][«(x)] = 
Lf(x)u(x)] = [1], so that [Ax] is a unit in F[x]/(p(x)). m 
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EXAMPLE 4 


Since x — 2 is irreducible in Q[x], 2x + 5 and X? ~ 2 are relatively prime in Q[x]. 
(Why?) Hence, [2x + 5]isa unit in the ring O[x]/(x? — 2). The proof of Theorem 5.9 
shows that its inverse is [u(x)], where (2x + 5)(x) + QÊ — JX = 1. Using the 
Euclidean Algorithm as in Exercise 15 of Section 1.2, we find that 


(2x + s(x 3) + (x7 - d£) = 


Therefore, [- = EF i ae is the inverse of [2x + 5] in Q[x]/(2? — 2). 


E Exercises 


A. In Exercises 1—4, write out the addition and multiplication tables for the congruence- 
class ring F[x]/(p(x)). In each case, is F[x]/(p(x)) a field? 
lL. F=Z; p(x) =x? +x4+1 2. F=2Z,3A0=x +1 
3. F=Zy3 p(x) =x? +1 4. F= Zp) =x +1 
B. In Exercises 5-8, each element of the given congruence-class ring can be written 
in the form [ax + b] (Why?). Determine the rules for addition and multiplication 


of congruence classes. (In other words, if the product [ax + bl[cex + d] is the 
class[rx + s], describe how to find r and s from a, b, c, d, and similarly for 


addition.) 
5. Rix]/(x? + 1) [Hint: See Example 1.] 
6. Q[x]/(x? — 2) 7. Q — 3) 8. O[x]/(x”) 


9. Show that R[x]/(x? + 1) is a field by verifying that every nonzero congruence 
class [ax + 5] is a unit. [Hint: Show that the inverse of [ax + b]is [cx + d], 
where c = —a/(a® + b?) and d = b/(@ + b°) 

10. Let F be a field and p(x) E F[x]. Prove that F* = {[a]| a E F} isa subring of 
F[x]/(P(x)). 


11. Show that the ring in Exercise 8 is not a field. 


12. Write out a complete proof of Theorem 5.6 (that is, carry over to F[x] the 
proof of the analogous facts for Z). 

13. Prove the first statement of Theorem 5.7. 

14. In each part explain why [ f{x)] is a unit in F[x]/(p(x)) and find its inverse. 
[Hint: To find the inverse, let u(x) and v(x) be as in the proof of Theorem 5.9. 
You may assume that u(x) = ax + b and u(x) = cx + d. Expanding f(x)u(x) + 
P(x)u(x) leads to a system of linear equations in a, b, c, d. Solve it.] 

(a) [f(x)] = 2x ~ 31E Q] — 2) 
(b) [fO] = p? + x + JEZ + 1) 


— 
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C.15. Find a fourth-degree polynomial in Z,[x] whose roots are the four elements of 
the field Z,[x]/ (x? + x + 1), whose tables are given in Example 3. [Hint: The 
Factor Theorem may be helpful.] 


16. Show that Q[x]/(x? — 2) is a field. 


| 5.3 | The Structure of F[x]/(p(x)) When p(x) Is Irreducible 


When p is a prime integer, then Theorem 2.8 states, in effect, that Z, is a field (and, of 
course, an integral domain). Here is the analogous result for F[x] and an irreducible 
polynomial p(x). 


Theorem 5.10 


Let F be a field and p(x) a nonconstant polynomial in F[x]. Then the following 
statements are equivalent: 


(1) p(x) is irreducible in Fx]. 
(2) F[x]/(p(x)) is a field. 
(3) Fx] /(p(x)) is an integral domain. 


Theorem 5.10 and most of its proof are a copy of Theorem 2.8 and its proof, with 
Z replaced by F[x] and Z, by F{x)/(p(x)), and the necessary adjustments made for the 
differences between prime integers and irreducible polynomials. 


Proof of Theorem 5.10 » (1) = (2) By Theorem 5.7, F(x)/(p(x)) is a commutative 
ring with identity, and thus satisfies Axioms 1-10. To prove that 
F(x)/(p(x)) is a field, we must verify that every nonzero element in 
F(x)/(p(x)) is a unit (Axiom 12, page 49). Suppose that [a(x)] # [0] in 
F(x)/(p(x)). We must find [2x)] such that [a(x)] [u(x)] = [17]. Since 
[a(x)] # [0], we know that a(x) # 0 (mod p(x)) by Theorem 5.3. Hence, 
P(x) x a(x) by the definition of congruence. Now the ged of a(x) and 
p(x) is a monic polynomial that divides both a(x) and p(x). Since p(x) 
is irreducible, the gcd is either 1, or a monic associate of p(x) (the only 
monic divisors of p(x)). As explained on page 100, an associate of p(x) 
is a polynomial of the form cp(x), with 0p # c EF. Consequently, a(x) 
is not divisible by any associate of p(x) (because a(x) is not divisible by 
P(x)). Since the ged also divides a(x) and p(x) ¥ a(x), the ged of a(x) and 
P(x) must be 1p. By Theorem 4.8, there are polynomials u(x) and v(x) so 
that a(x)u(x) + p(x)u(x) = 15. Hence, a(x)u(x) — 1p = p(x)(—v(x)), so 
that a(x)x) = 1p (mod p(x)). Therefore, [a(x)u(x)] = [1A in F(x)/(P(x)) 
by Theorem 5.3. Thus, [a(x)][u(x)] = [a(x)u(x)] = [ld, so that [a(x)] is a 
unit. Hence, F(x)/(p(x)) satisfies Axiom 12 and F(x)/(p(x)) is a field. 


(2) => (3) This is an immediate consequence of Theorem 3.8. 
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(3) => (1) We shall verify statement (2) of Theorem 4.12 to show that 
P(x) is irreducible. Suppose that b(x) and ¢(x) are any polynomials in F[x] 
and p(x) | b(x)e(x). Then 5(x)c(x) = Or (mod p(x)). So by Theorem 5.3, 


BONECA = Bee) = [Or] in Foe). 


Because F(x)/(p(x)) is an integral domain by (3), we have [a(x)] = [0,] 
or [b(x)] = [0z]. Thus, (x) = 0; (mod p(x)) or e(x) = 0r (mod p(x)) by 
Theorem 5.3, which means that p(x) | b(x) or p(x) | c(x) by the definition 
of congruence. Therefore, p(x) is irreducible by Theorem 4.12. W 


Theorem 5.10 can be used to construct finite fields. If p is prime and fx) is irreduc- 
ible in Z,[x] of degree k, then Z,[x]/(/(x)) is a field by Theorem 5.10. Example 7 in 
Section 5.1 shows that this field has p* elements. Finite fields are discussed further in 
Section 11.6, where it is shown that there are irreducible polynomials of every positive 
degree in Z, [x] and, hence, finite fields of all possible prime power orders. See Exercise 9 
for an example. 

Let F be a field and p(x) an irreducible polynomial in F[x]. Let K denote the field of 
congruence classes F[x]/(p(x)). By Theorems 5.8 and 5.10, F is a subfield of the field 
K. One also says that K is an extension field of F. Polynomials in F[x] can be consid- 
ered to have coefficients in the larger field K, and we can ask about the roots of such 
polynomials in K. In particular, what can be said about the roots of the polynomial 
P(x) that we started with? Even though p(x) is irreducible in F[x], it may have roots in 
the extension field K. 


EXAMPLE 1 


The polynomial p(x) = x? + x + 1 has no roots in Z, and is, therefore, irreducible 
in Z,[x] by Corollary 4.19. Consequently, K = Z,[x]/(x? + x + 1) is an extension 
field of Z, by Theorem 5.10. Using the tables for K in Example 3 of Section 5.2, 
we see that 


xP + e +1=[x+ 1 +f] 4+1=14+1=0. 


This result may be a little easier to absorb if we use a different notation. Let 

a = [x]. Then the calculation above says that a? + æ + 1 = 0; that is, a is a root 
in Kof p(x) = x? + x + 1. It’s important to note here that you don’t really 
need the tables for K to prove that a is a root of p(x) because we know that 

xX + x + 1 =0 (mod x + x + 1). Consequently, [x? + x + 1] = 0 in K, and 

by the definition of congruence-class arithmetic, 


a +a +1 = fx + [x+ l =p ?+x4+ 1] =0. 


For the general case we have 


Theorem 5.11 


Let F be a field and p(x) an irreducible polynomial in F[x]. Then F[x]/(p(x)) is an 
extension field of F that contains a root of p(x). 
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Proof» Let K = F []/(p(x)). Then K is an extension field of F by Theorems 5.8 
and 5.10. Let p(x) = a,x" + +++ + a,x + a, where each a, is in Fand, 
hence, in K. Let a = [x] in K. We shall show that a is a root of p(x). By 
the definition of congruence-class arithmetic in K, 


aa” +++ + aya t+ ay = a,[x]" ++ +++ af] + a 
= [a,x" + +++ + a,x + ay] 
= [p@)] = Or [Because p(x) = Op (mod p(x)).} 


Therefore, a € Kis a root of p(x). E 


Corollary 5.12 


Let F be a field and f(x) a nonconstant polynomial in F[x]. Then there is an 
extension field K of F that contains a root of f(x). 


Proof» By Theorem 4.14, f(x) has an irreducible factor p(x) in F [x]. By Theorem 
5.11, K = F[x]/(p(x)) is an extension field of F that contains a root of p(x). 
Since every root of p(x) is a root of f(x), K contains a root of f(x). m 


The implications of Theorem 5.11 run much deeper than might first appear. 
Throughout the history of mathematics, the passage from a known number system to a 
new, larger system has often been greeted with doubt and distrust. In the Middle Ages, 
some mathematicians refused to acknowledge the existence of negative numbers. When 
complex numbers were introduced in the seventeenth century, there was uneasiness—- 
which extended for nearly a century—because some mathematicians would not accept 
the idea that there could be a number whose square is —1, that is, a root of x? + 1. One 
cause for these difficulties was the lack of a suitable framework in which to view the 
situation, Abstract algebra provides such a framework. Theorem 5.11 and its corollary, 
then, take care of the doubt and uncertainty. 

It is instructive to consider the complex numbers from this point of view. Instead 
of asking about a number whose square is —1, we ask, “Is there a field containing 
R in which the polynomial x? + 1 has a root?” Since x? + 1 is irreducible in R[x], 
Theorem 5.11 tells us that the answer is yes: K = R[x]/(x? + 1) is an extension field of 
R that contains a root of x? + 1, namely a = [x]. In the field K, œ is an element whose 
square is —1. But how is the field K related to the field of complex numbers introduced 
earlier in the book? 

As is noted in Example 5 of Section 5.1, every element of K = R[x]/(x? + 1) can 
be written uniquely in the form [ax + b] with a, b ER. Since we are identifying each 
element r E R with the element [r] in K, we see that every element of K can be written 
uniquely in the form 


[a + dx] = [a] + [5][x] = a + ba. 
Addition in K is given by the rule 


(a + ba) + (c + da) = [a + bx] + [c + dx] = [(a + bx) + (c + dx)] 
= [(a + c) + (b + dx] = [a + c] + [b + di]. 
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so that 
(a + ba) + (c + da) = (a + c) + (b + d)a. 
Multiplication in K is given by the rule 


(a + ba)(c + da) = [a + bx][c + dx] = [(a + bx)(e + dx)] 
= [ac + (ad + be)x + bdx?| 
= ac + (ad + be)a + bda’. 


However, a is a root of x? + 1, and so a? = —1. Therefore, the rule for multiplication 
in K becomes 


(a + ba)(c + da) = (ac — bd) + (ad + be)a. 


If the symbol « is replaced by the symbol i, then these rules become the usual rules for 
adding and multiplying complex numbers. In formal language, the field K is isomor- 
phic to the field C, with the isomorphism f being given by fa + ba) = a + bi. 

Up to now we have taken the position that the field C of complex numbers was 
already known. The field K constructed above then turns out to be isomorphic to the 
known field C. A good case can be made, however, for not assuming any previous 
knowledge of the complex numbers and using the preceding example as a definition 
instead. In other words, we can define € to be the field R [x]/(x? + 1). Such a definition 
is obviously too sophisticated to use on high-school students, but for mature students 
it has the definite advantage of removing any lingering doubts about the validity of 
the complex numbers and their arithmetic.* Had this definition been available several 
centuries ago, the introduction of the complex numbers might have caused no stir 
whatsoever. 


E Exercises 


NOTE: F always denotes a field. 


A. 1. Determine whether the given congruence-class ring is a field. Justify your 
answer. 


(a) Z3[x]/(8 + 2x? + x + 1) 
(b) Zs[x]/(2x? — 4x? + 2x + 1) 
© Zx + x7 + 1) 
B. 2. (a) Verify that Q(V2) = {r + sV2 |r, s€ Q} is a subfield of R. 


(b) Show that Q(-V2) is isomorphic to Q[x]/(x? — 2). [Hint: Exercise 6 in 
Section 5.2 may be helpful.] 


*Only a minor rearrangement of this book is needed to accommodate such a definition. A few 
examples in Chapter 3 would have to be omitted, and the discussion of irreducibility in C[x] 
and R[x] (Section 4.6) would have to be postponed. All the intervening material in Chapter 5 is 
independent of any formal knowledge of the complex numbers. 
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3. If ac F, describe the field F [x]/(x — a). 


4. Let p(x) be irreducible in F [x]. Without using Theorem 5.10, prove that if 
(fle) = [0A in F[x]/(p(>)), then [f(x] = [Og] or [g()] = [0,). [Hint: 
Exercise 10 in Section 5.1.] 

5. (a) Verify that Q(-V3) = {r + sV3 |r, se Q} is a subfield of R. 

(b) Show that Q(-V3) is isomorphic to Q[x]/(x* — 3). 


6. Let p(x) be irreducible in F [x]. If [f (x) + [04 in F[x]/(p(x)) and A(x) € 
F [x], prove that there exists g(x) € F [x] such that [ /(x)][g(x)] = [A(x] in 
F[x]/(p(x)). [Hint: Theorem 5.10 and Exercise 12(b) in Section 3.2.] 


7. If f(x) € F[x] has degree n, prove that there exists an extension field £ of 
F such that f (x) = e9(x — ¢1)(x — c) * ++ (x — c) for some (not necessarily 
distinct) c € E. In other words, E contains all the roots of f(x). 


8. If p(x) is an irreducible quadratic polynomial in F [x], show that F [x]/(p(x)) 
contains all the roots of p(x). 
9. (a) Show that Z,[x]/(x? + x + 1) isa field. 
(b) Show that the field Z,[x]/(x? + x + 1) contains all three roots of x? + x + 1. 
10. Show that Q[x]/(x? — 2) is not isomorphic to Q[x]/(2? — 3). [Hint: Exercises 2 
and 5 may be helpful.] 


11. Let K be a ring that contains Z, as a subring. Show that p(x) = 3x7 + 1 € Zex] has 
no roots in K. Thus, Corollary 5.12 may be false if F is not a field. [Hint: If u 
were a root, then 0 = 2+ 3 and 34? + 1 = 0. Derive a contradiction.] 

12. Show that 2x? + 4x7 + 8x + 3 € Z,e[x] has no roots in any ring K that contains 
Zi as asubring. [See Exercise 11.] 


C. 13. Show that every polynomial of degree 1, 2, or 4 in Z.[x] has a root in 
Zx + x + 1). 
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CHAPTER § 


ideals and Quotient Rings 


Congruence in the integers led us to the finite arithmetics Z, and helped moti- 
vate the definition of a ring. Congruence in the polynomial ring F[x] resulted in a 
new class of rings consisting of the various F[x]/(p(x)). These rings enabled us to 
construct extension fields of F that contained roots of the polynomial p(x). In this 
chapter the concept of congruence is extended to arbitrary rings, producing 
additional rings and a deeper understanding of algebraic structure. 

You will see that much of the discussion is an exact parallel of the development 
of congruence in Z (Chapter 2) and in F{x] (Chapter 5). Nevertheless, the results 
here are considerably broader than the earlier ones. 


| 6.1 | Ideals and Congruence 


Our goal is to develop a notion of congruence in arbitrary rings that includes as spe- 
cial cases congruence modulo z in Z and congruence modulo p(x) in F[x]. We begin by 
taking a second look at some examples of congruence in Z and F[x] from a somewhat 
different viewpoint than before. 


EXAMPLE 1 
In the ring Z, a = b (mod 3) means that a — b is a multiple of 3. Let J be the set 
of all multiples of 3, so that 
I= {0, +3, +6,...}. 
Then congruence modulo 3 may be characterized like this: 


a = b (mod 3) means a— bel. 
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Observe that the subset J is actually a subring of Z (sums and products of mul- 
tiples of 3 are also multiples of 3). Furthermore, the product of any integer and 
a multiple of 3 is itself a multiple of 3. Thus the subring 7 has this property: 


Whenever k eZ and ic J, then ki€/. 


EXAMPLE 2* 


The notation f(x) = g(x) (mod x? — 2) in the polynomial ring Q[x] means that 
fœ) — g(x) is a multiple of x? — 2. Let Ibe the set of all multiples of x? — 2 in Qf, 
that is, Z = {A(x\(x? — 2)| A(x) E Q[x]}. Once again, it is not difficult to check that T is 
asubnng of Q[x] with this property: 


Whenever A(x) € Q[x] and “(x)eE I, then k(x)i(x) ET 


(the product of any polynomial with a multiple of x? — 2 is itself a multiple of x* — 2). 
Congruence modulo x? — 2 may be described in terms of I: 


f(x) = g(x) (mod x — 2) means f{x)— g(x)eEl. 


These examples suggest that congruence in a ring R might be defined in terms 
of certain subrings. If J were such a subring, we might define a = b (mod /) to 
mean a — bel. The subring 7 might consist of all multiples of a fixed element, as in 
the preceding examples, but there is no reason for restricting to this situation. The 
examples indicate that the key property for such a subring Z is that it “absorbs prod- 
ucts”: Whenever you multiply an element of J by any element of the ring (either inside 
or outside J), the resulting product is an element of J. The set of all multiples of a fixed 
element has this absorption property. We shall see that many other subrings have it as 
well. Because such subrings play a crucial role in what follows, we pause to give them 
a name and to consider their basic properties. 


Definition A subring / of a ring A is an ideal provided: 


Whenever re Rand ae/, then rac/andarel. 


The double absorption condition that rae J and ar EF is necessary for noncommutative rings. 
When R is commutative, as in the preceding examples, this condition reduces to ra E I. 


EXAMPLE 3 


The zero ideal in a ring R consists of the single element Og. This is a subring that absorbs all 
products since r0z = Op = Ogr for every r E R. The entire ring R is also an ideal. 


*Skip this example if you have not read Chapter 5. 
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EXAMPLE 4 


In the ring Z[x] of all polynomials with integer coefficients, let J be the set of 
polynomials whose constant terms are even integers. Thus xX? + x + 6is in J, 
but 4x? + 3 is not. Verify that 7 is an ideal in Z[x] (Exercise 2). 


EXAMPLE 5 


Let T be the ring of all functions from R to R, as described in Example 8 
of Section 3.1. Let Z be the subset consisting of those functions g such that 
2(2) = 0. Then Z is a subring of T (Exercise 14 of Section 3.1). If fis any 
function in T and if gE J, then 


XD = f(2)g(2) = f(2) +0 = 0. 
Therefore, fg EI. Similarly, g f'€ I, so that is an ideal in T. 


EXAMPLE 6 


The subring Z of the rational numbers is not an ideal in Q because Z fails to 
; ? l 3 
have the absorption property. For instance, 2€ Q and 5EZ, but their product, 


Siy 
= t inZ. 
z is not in 


EXAMPLE 7 


Verify that the set J of all matrices of the form F “i with a, bE R forms a 


b 
subring of the ring M(R) of all 2 X 2 matrices over the reals. It is easy to see 


that J absorbs products on the /eft: 


F NG 4 a Tie et 
t uj\b 0 ta+ub 0 f 


But J is not an ideal in M(R) because it may not absorb products on the right—for 


m9 = Dar 


One sometimes says that J is a left ideal, but not a two-sided ideal, in M(R). 


The following generalization of Theorem 3.6 often simplifies the verification that a 
particular subset of a ring is an ideal. 
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Theorem 6.1 


A nonempty subset / of a ring R is an ideal if and only if it has these properties: 
(i) ifa, b e I, thena—b Ef; 
(ii) ifr e Randa e l, thenra e landar e /, 


Proof > Every ideal certainly has these two properties. Conversely, suppose I 
has properties (i) and (ii). Then J absorbs products by (ii), so we need 
only verify that Zis a subring. Property (i) states that 7 is closed under 
subtraction. Since J is a subset of R, the product of any two elements 
of I must be in Z by (ii). In other words, J is closed under multiplication. 
Therefore, Jis a subring of R by Theorem 3.6. E 


Finitely Generated Ideals 


In the first example of this section we saw that the set J of all multiples of 3 is an ideal 
in Z. This fact is a special case of 


Theorem 6.2 


Let R be a commutative ring with identity, c ER, and / the set of all multiples 
of c in R, that is, / = {rc |r ER}. Then / is an ideal. 


Proof» 1f ny f» FER and ne, r EI, then 
re- re= (r-rel and ric) =(rrjcel 


because r; — r, and rr, are elements of R. Similarly, since R is commuta- 
tive, (mer = (rrı)c E E. Therefore, Fis an ideal by Theorem 6.1. m 


The ideal Jin Theorem 6.2 is called the principal ideal generated by c and hereafter 
will be denoted by (c). In the ring Z, for example, (3) indicates the ideal of all multiples 
of 3. In any commutative ring R with identity, the principal ideal (1,) is the entire ring 
R because r = rlp for every re R. It can be shown that every ideal in Z is a principal 
ideal (Exercise 40). However, there are ideals in other rings that are not principal, that 
is, ideals that do not consist of all the multiples of a particular element of the ring. 


EXAMPLE 8 


We have seen that the set J of all polynomials with even constant terms is an 
ideal in the ring Z[x]. We claim that J is not a principal ideal. To prove this, 
suppose, on the contrary, that J consists of all multiples of some polynomial 
P(x). Since the constant polynomial 2 is in J, 2 must be a multiple of p(x). 
By Theorem 4.2, this is possible only if p(x) has degree 0, that is, if p(x) is a 
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constant, say p(x) = c. Since p(x) E J, the constant c must be an even integer. Since 
2is a multiple of p(x) = e, the only possibility is ¢ = +2. On the other hand, x EJ 
because it has even constant term 0. Therefore, x must be a multiple of p(x) = 
However, if +2g(x) = x, then g(x) has degree 1 by Theorem 4.2, say g(x) = ax + b. 
But +2(ax + b) = x implies that +2a = 1 because the coefficient of x must be the 
same on both sides. This is impossible because a is an integer. Therefore, J does not 
consist of all multiples of p(x) and is not a principal ideal. 


In a commutative ring with identity, a principal ideal consists of all multiples of a 
fixed element. Here is a generalization of that idea. 


Theorem 6.3 


Let R be a commutative ring with identity and c;, Cz, . . . , Ca ER. Then the set 
P= {F41 + Foco to + + fnCn | M4) Fo, «a < fa ER} is an ideal in R. 


Proof» Exercise 14. m 


The ideal Jin Theorem 6.3 is called the ideal generated by cı, c2,... , €„ and is 
sometimes denoted by (cj, ĉ2, . . . , &). Such an ideal is said to be finitely generated. A 
principal ideal is the special case n = 1, that is, an ideal generated by a single element.* 
The generators of a finitely generated ideal need not be unique, that is, the ideal gener- 
ated by c}, c,,...,¢, might be the same set as the ideal generated by d,, dz, . . . , dg, even 
though no c, is equal to any d; (Exercise 16). 


EXAMPLE 9 


In the ring Z[x], the ideal generated by the polynomial x and the constant poly- 
nomial 2 consists of all polynomials of the form 


fox + g(x)2, with f(x), g0) E Zi. 


It can be shown that this ideal is the ideal 7 of all polynomials with even 
constant term, which was discussed in Example 8 (Exercise 15). 


Congruence 
Now that you are familiar with ideals, we can define congruence in an arbitrary ring: 
Definition 


Let / be an ideal in aring Rand let a, b ER. Then ais congruentto b modulo 
I [written a = b (mod /)] provided that a — be/. 


*When a commutative ring does not have an identity, the ideal generated by c;, ĉe, ..., ¢, is defined 
somewhat differently (see Exercise 33). 
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Example | shows that congruence modulo 3 in the integers is the same thing as 
congruence modulo the ideal J, where J is the principal ideal (3) of all multiples of 3. 
Similarly, Example 2 shows that congruence modulo x? — 2 in Q[x] is the same as con- 
gruence modulo the principal ideal (x* — 2). Thus congruence modulo an ideal includes 
as a special case the concepts of congruence in Z and F[x] used earlier in this book. 


EXAMPLE 10 


Let T be the ring of all functions from R to R and let 7 be the ideal of all func- 
tions g such that g(2) = 0. If f(x) = x? + 6 and A(x) = 5x, then the function 
f— hisin I because 


(f — AY) = f(2) — h(2) = (2 + 6) — (5+ 2) = 
Therefore, f = h (mod J). 


Theorem 6.4 


Let / be an ideal in a ring R. Then the relation of congruence modulo / is 


(1) reflexive: a = a (mod /) for every a E R; 
(2) symmetric: if a = b (mod /), then b = a (mod /); 
(3) transitive: if a = b (mod /) and b = c (mod /), then a = c (mod /). 


This theorem generalizes Theorems 2.1 and 5.1. Observe that the proof is virtually 
identical to that of Theorem 2.1—just replace statements like “k is divisible by n” or 
“n|k” or “k = nt” with the statement “ke I”. 


Proof of Theorem 6.4 > (1) a — a= ORE I; hence, a= a (mod J). 

(2) a =b (mod J means that a — b = i for some ic F. Therefore, b — a = 
— (a — b) = —i. Since Tis an ideal, the negative of an element of J is also 
in J, and so b — a = —i € I. Hence, b =a (mod J). 

(3) If a = b (mod J and b = c (mod D, then by the definition of con- 
gruence, there are elements and J in Z such that a — b = i and b — c =j. 
Therefore, a — c = (a — b) + (b — c) = i + J. Since the ideal Jis closed under 
addition, i+ jE Tand, hence, a = c (mod J). E 


Theorem 6.5 
Let / be an ideal in a ring R. lf a = b (mod /) and c = d (mod /), then 


(1) a+c =b+d(mod /); 
(2) ac = bd (mod /). 
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This theorem generalizes Theorems 2.2 and 5.2. Its proof is quite similar to theirs 
once you make the change to the language of ideals. 


Proof of Theorem 6.5 » (1) By the definition of congruence, there are i, JE J such 
thata — b = i and c — d = j. Therefore, (a + c) — (b + d) = (a — b) + 
(c — d) =i + jEI Hence, a +c =b + d(mod TI). 


(2) ac — bd = ac — be + be — bd = (a — b)c + b(c — d) = ic + bj. Since 
the ideal J absorbs products on both left and right, ice Fand bje T. Hence, 
ac — bd = ic + bje I. Therefore, ac = bd (mod f). m 


If Tis an ideal in a ring Rand aE R, then the congruence class of a modulo / is the 
set of all elements of R that are congruent to a modulo 7, that is, the set 


{bE R|b = a (mod I} = {bE R|b -ae R 
= {bER|b- a=iwithieR 
= {bER|b=ati, withieR 
= {a+iljieh. 
Consequently, we shall denote the congruence class of a modulo J by the symbol a + I 
rather than the symbol [a] that was used in Z and F[x]. The plus sign in a + Fis just a 


formal symbol; we have not defined the sum of an element and an ideal. In this con- 
text, the congruence class a + Fis usually called a (left) coset of Jin R. 


Theorem 6.6 


Let / be an ideal in a ring R and let a,c © R. Then a = c (mod /) if and only 
ifat+i=ct+i. 


Proof > with only minor notational changes, the proof of Theorem 2.3 carnes 
over almost verbatim to the present case. Simply replace “mod n’” by “mod 
I” and “[a]” by “a + T’; use Theorem 6.4 in place of Theorem 2.1. m 


Corollary 6.7 


Let / be an ideal in a ring A. Then two cosets of / are either disjoint or identical. 
Proof Copy the proof of Corollary 2.4 with the obvious notational changes E 


If is an ideal in a ring R, then the set of all cosets of J (congruence classes modulo 7) 
is denoted R/T. 


EXAMPLE 11 


Let J be the principal ideal (3) in the ring Z. Then the cosets of J are just the 
congruence classes modulo 3, and so there are three distinct cosets: 0 + J = [0], 
1+J7=(1], and 2 + J = [2]. The set Z /I of all cosets is precisely the set Z, in 
our previous notation. 
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EXAMPLE 12 


Let I be the ideal in Z[x] consisting of all polynomials with even constant 
terms. We claim that Z[x]/J consists of exactly two distinct cosets, namely, 

0 + Jand 1 + J. To see this, consider any coset f(x) + I. The constant term of 
J (x)is either even or odd. If it is even, then f(x) € I, so that f(x) = 0 (mòd J). 
Therefore, f (x) + J = 0 + I by Theorem 6.6. If f (x) has odd constant term, 
then f(x) — 1 haseven constant term, so that f(x) = 1 (mod J). Thus f(x) + J= 
1 + Z by Theorem 6.6. 


EXAMPLE 13 


Let T be the ring of functions from R to R and let J be the ideal of all functions 
g such that g(2) = 0. Note that for each real number r, the constant function f, 
(whose rule is f(x) = r) is an element of T. Let A(x) be any element of T. Then 
h(2) is some real number, say A(2) = c, and 

(h — f)(2) = h(2) - f.(2) =c- = 0. 


Thus hk — f,€J, so that h = f, (mod J) and, hence, h + I = f, + I. Consequently, 
every coset of J can be written in the form f, + J for some real number r. 
Furthermore, if c # d, then £,(2) # fa(2), so that [fa — f4(2) # 0 andf, — fag I. 
Hence, f, # J; (mod J) and f, + I + f4 + I. Therefore, there are infinitely many dis- 
tinct cosets of J, one for each real number r. 


E Exercises 


NOTE: R denotes a ring. 


A. 1. Show that the set K of all constant polynomials in Z[x] is a subring but not an 
ideal in Z[x]. 


2. Show that the set J of all polynomials with even constant terms is an ideal in 


2[x]. 
3. (a) Show that the set J = {(k, 0) |k E Z} is an ideal in the ring Z Xx Z. 
(b) Show that the set T = {(k, k) |k € Z} is not an ideal in Z X Z. 


4. Is the set J = e a Jre R} an ideal in the ring M(R) of 2 X 2 matrices 
over R? 


5. Show that the set K = {(¢ i Ja, bER > isa subring of M(R) that absorbs 


products on the right. Show that K is not an ideal because it may fail to 
absorb products on the left. Such a set K is sometimes called a right ideal. 


6. (a) Show that the set of nonunits in Zg is an ideal. 


(b) Do part (a) for Zo. [Also, see Exercise 24.] 
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7. Let ce Rand let J = {re|re R}. 


(a) If R is commutative, prove that / is an ideal (that is, Theorem 6.2 is true 
even when R does not have an identity). 


(b) If Ris commutative but has no identity, is c an element of the ideal 7? 
[Hint: Consider the ideal {24 |k € E} in the ring E of even integers. Also see 
Exercise 33.] 


(c) Give an example to show that if R is not commutative, then J need not be 
an ideal. 


8. If Jis an ideal in R and J is an ideal in the ring S, prove that J X J is an ideal in 
the ring R X S. 


9. Let R be a ring with identity and let Tbe an ideal in R. 
(a) If 1REI, prove that J = R. 
(b) If Z contains a unit, prove that J = R. 
10. If J is an ideal in a field F, prove that J = (Og) or J = F. [Hint: Exercise 9.] 
11. List the distinct principal ideals in each ring: 
(a) Zs (b)Zo (€)Zi2 
12. List the distinct principal ideals in Z, X Z3. 


13. If R is a commutative ring with identity and (a) and (b) are principal ideals 
such that (a) = (b), is it true that a = b? Justify your answer. 


14. Prove Theorem 6.3. 


15. Show that the ideal generated by x and 2 in the ring Z [x] is the ideal J of all 
polynomials with even constant terms (see Example 9). 


16. (a) Show that (4, 6) = (2) in Z, where (4, 6) is the ideal generated by 4 and 6 
and (2) is the principal ideal generated by 2. 


(b) Show that (6, 9, 15) = (3) in Z. 
17. (a) If Zand J are ideals in R, prove that 7 N J is an ideal. 


(b) If [4] is a (possibly infinite) family of ideals in R, prove that the 
intersection of all the J, is an ideal. 


18. Give an example in Z to show that the set theoretic union of two ideals may 
not be an ideal (in fact, it may not even be a subring). 


19. If J is an ideal in R and S is a subring of R, prove that JM S is an ideal in S. 


20. Let J and J be ideals in R. Prove that the set K = {a + blael, bE J} isan 
ideal in R that contains both J and J. K is called the sum of Jand Jand is 
denoted J + J. 


21. If dis the greatest common divisor of a and b in Z, show that (a) + (b) = (d). 
(The sum of ideals is defined in Exercise 20.) 


22. Let J and J be ideals in R. Is the set K = {ab |a E I, bE J} an ideal in R? 
Compare Exercise 20. 
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23. 


B.24. 


25. 


26. 


27. 


28. 


33. 


Ideals and Quotient Rings 


(a) Verify that J = {0, 3} is an ideal in Z, and list all its distinct cosets. 
(b) Verify that J = {0, 3, 6,9, 12} is an ideal in Z} and list all its distinct cosets. 


Let R be a commutative ring with identity, and let N be the set of nonunits in 
R. Give an example to show that N need not be an ideal. 


Let J be an ideal in R. Prove that J is an ideal, where 
I= {rE R|rt = 0g for every te J}. 
Let J be an ideal in R. Prove that K is an ideal, where 
K = {ae R|rae/ for every re R}. 
Let f:R — S be a homomorphism of rings and let 
K = {reRIf(r) = 05}. 
Prove that K is an ideal in R. 


If I is an ideal in R, prove that J[x] (polynomials with coefficients in J) is an 
ideal in the polynomial ring R[x]. 


. If (m, n) = 1 in Z, prove that (m) N (n) is the ideal (mn). 


. Prove that the set of nilpotent elements in a commutative ring R is an ideal. 


[Hint: See Exercise 44 in Section 3.2.] 


. Let R be an integral domain and a, be R. Show that (a) = (b) if and only if 


a = bu for some unit uE R. 


. (a) Prove that the set J of all polynomials in Z [x] whose constant terms are 


divisible by 3 is an ideal. 
(b) Show that J is not a principal ideal. 


Let R be a commutative ring without identity and let a E R. Show that 
A = {ra + na|rE R,nEZ} is an ideal containing a and that every ideal 
containing a also contains A. A is called the principal ideal generated by a. 


34. If M is an ideal in a commutative ring R with identity and if a€ R with æ ¢ M, 
prove that the set 
J={m+ra|reRandmeM} 
is an ideal such that M G J. 
35. Let I be an ideal in Z such that (3) ¢ J¢ Z. Prove that either J = (3) or J= Z. 


36. 


37. 


38. 


Let Jand J be ideals in R. Let ZJ denote the set of all possible finite sums of 
elements of the form ab (with a E I, bE J), that is, 


IJ = {abı + aba +--- + a,b, | n2z1,¢,6], beJ}. 
Prove that JJ is an ideal, JJ is called the product of 7 and J. 


Let R be a commutative ring with identity 1p # Og whose only ideals are 
(Og) and R. Prove that R is a field. [Hint: If a # Og, use the ideal (a) to find a 
multiplicative inverse for a.] 
Let / be an ideal in a commutative ring R and let 

J = {re R|r € Ifor some positive integer n}. 
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Prove that J is an ideal that contains 7. [Hint: You will need the Binomial 
Theorem from Appendix E. Exercise 30 is the case when J = (Qg).] 


39. (a) Show that the ring M(R) is not a division ring by exhibiting a matrix that 
has no multiplicative inverse. (Division rings are defined in Exercise 42 of 
Section 3.1.) 


(b) Show that M(R) has no ideals except the zero ideal and M(R) itself. 
[Hint: If Jis a nonzero ideal, show that J contains a matrix A witha 
nonzero entry c in the upper left-hand corner. Verify that 


1 0 ct 0 1 0 F F ae 
= AZEN 0 = 2 m and that this matrix is in J. Similarly, 


show that (o `) is in J. What is their sum? See Exercise 9.] 


40. Prove that every ideal in Z is principal. [Hint: If J is a nonzero ideal, show that 
I must contain positive elements and, hence, must contain a smallest positive 
element c (Why?). Since ¢ EI, every multiple of ¢ is also in J; hence, (c) ¢ T. 

To show that J C (e), let a be any element of I. Then a = ceq + r with0 sr < c¢ 
(Why?). Show that r = 0 so that a = cq E(c).] 


41. (a) Prove that the set S of rational numbers (in lowest terms) with odd 
denominators is a subring of Q. 


(b) Let J be the set of elements of S with even numerators. Prove that J is an 
ideal in S. 


(c) Show that S/I consists of exactly two distinct cosets. 


42. (a) Let p be a prime integer and let T be the set of rational numbers (in lowest 
terms) whose denominators are not divisible by p. Prove that T is a ring. 


(b) Let J be the set of elements of T whose numerators are divisible by p. 
Prove that J is an ideal in T. 


(c) Show that T/I consists of exactly p distinct cosets. 
43. Let J be the set of all polynomials with zero constant term in Z[>]. 
(a) Show that J is the principal ideal (x) in Z[x]. 


(b) Show that Z[x]/J consists of an infinite number of distinct cosets, one for 
eachn eZ. 


44. (a) Prove that the set T of matrices of the form i J with a, beRisa 
subring of M(R). a 


(b) Prove that the set Jof matrices of the form i 0) with bER is an ideal 
in the ring T. 


(c) Show that every coset in T/J can be written in the form if °) +0. 


(Copyrigiv 2012 Cengage Lemming. AR Rights Caamved May not ba copied, exceed or dnpficmatt in whote or ia parl Dus to dlactronic dgis, mms third pasty contest may be mppmmai fren the eBook endfor eChagear(s). Edimrial review has 
deemed that any cappreses4 aatan doaa oct metadaNy «fisct the overt Learning axperiamre Cangege Learning maacves the right 0) ramova efidixtonel cute! af easy tne if sthencpemt tights cestrictines require it. 


152 Chapter6 Ideals and Quotient Rings 


45. (a) Prove that the set S of matrices of the form G “ with a, b, ee Risa 
subring of M(R). A 


0 5b). ; $ 

(b) Prove that the set Jof matrices of the form ( a with b ER is an ideal 
in the ring S. 0 

(c) Show that there are infinitely many distinct cosets in S/I, one for each pair 
nR xR. 


C.46. Let F be a field. Prove that every ideal in F[x] is principal. [Hint: Use the 
Division Algorithm to show that the nonzero ideal Jin F[x] is (p(x)), where 
P(x) is a polynomial of smallest possible degree in /.] 


47. Prove that a subring S of Z, has an identity if and only if there is an element u 
in S such that u? = u and S is the ideal (x). 


Ea Quotient Rings and Homomorphisms 


We now show that the set of congruence classes modulo an ideal is itself a ring. As you 
might expect, this is a straightforward generalization of what we did with congruence 
classes in Z and F[x]. However, you may not have expected these rings of congruence 
classes to have close connections with some topics studied in Chapter 3, isomorphisms 
and homomorphisms. These connections are explored in detail and provide new insight 
into the structure of rings. 

Let J be an ideal in a ring R. The elements of the set R/T are the cosets of J (con- 
gruence classes modulo J), that is, all sets of the form a + J = {a + i| i€ I}. In order 
to define addition and multiplication of cosets as we did with congruence classes in Z 
and F[x], we need 


Theorem 6.8 


Let / be an ideal in a ring R. Ifa +/=b+/ande+/=d+/inR/I, then 
(a+c)+/=(b+d)+} and act/=bd+i. 
Proof > This is a generalization of Theorem 2.6, in slightly different notation. 


Replace “[a]” by “a + I” and copy the proof of Theorem 2.6, using 
Theorems 6.5 and 6.6 in place of Theorems 2.2 and 2.3. E 


We can now define addition and multiplication in R/J just as we did in #,, and 
F[x]/(p(@x)): The sum of the coset a + J (congruence class of a) and the coset e + I 
(congruence class of c) is the coset (a + c) + J (congruence class of a + c). In symbols, 


(atDt+(Ce+t+DHD=H@totrl 
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This statement may be a bit confusing because the plus sign is used with three entirely 
different meanings: 

as a formal symbol to denote a coset: a + J; 

as an operation on elements of R: a + c; 

as the addition operation on cosets that is being defined.* 
The important thing is that, because of Theorem 6.8, coset addition is independent 
of the choice of representative elements in each coset. Even if we replace a + 7 by an 
equal coset b + I and replace c + J by an equal coset d +7, the resulting coset sum, 
namely (b + d) + J, is the same as (a + e) + I. 


Multiplication of cosets is defined similarly and is independent of the choice of 
representatives by Theorem 6.8: 


(a+ D(c+D=act+1. 


EXAMPLE 1 


If J is the principal ideal (3) in Z, then addition and multiplication of cosets is 
the same as addition and multiplication of congruence classes in Section 2.2. 
Thus Z/Tis just the ring Z,. 


EXAMPLE 2t 


If Fis a field, p(x) is a polynomial in F[x], and J is the principal ideal (p(x)), 
then cosets of J are precisely congruence classes modulo p(x), so that addition 
and multiplication of cosets are done exactly as they were in Section 5.2. Thus 
F[x]/Tis the congruence-class ring F[x]/(p(x)). 


EXAMPLE 3 


Let J be the ideal of polynomials with even constant terms in Z[x]. As we saw 
in Example 12 of Section 6.1, Z[x]/I consists of just two distinct cosets, 0 + I 
and 1 + I. We have (1 +7) + (1 +7)=(1+1)+I=2 +I but2E/, so that 
2 = 0 (mod }) and, hence, 2 + J = 0 + T. Similar calculations produce the 
following tables for Z[x]/T. It is easy to see that Z[x]/T is a ring (in fact, a field) 
isomorphic to Z,: 


+ O+7 141 © | O+I 147 
O+T FE T+T O | OE OFE 
1+7 | 1+7 0+ 1+7 | 0+7 147 


*This ambiguity can be avoided by using a different notation for cosets, such as [a], and a different 
symbol for coset addition, such as @. The notation above is customary, however, and once you're 
used to it, there should be no confusion. 


tSkip this example if you have not read Chapter 5. 
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These examples illustrate the following theorem, which should not be very surpris- 
ing in view of your previous experience with Z and F[x]. 


Theorem 6.9 


Let / be an ideal in a ring R. Then 


(1) R// is a ring, with addition and multiplication of cosets as defined 
previously. 

(2) If R is commutative, then R// is a commutative ring. 

(3) If R has an identity, then so does the ring R//. 


Proof» (1) With the usual change of notation (“a + I” instead of “[a]”), the 
proof of Theorem 2.7 carries over to the present situation since that 
proof depends only on the fact that Z isa ring. Don’t take our word for 
it, though; write out the proof in detail for yourself. 

(2) If R is commutative and a, c E R, then ac = ca. Consequently, in 
R/Twe have(a +I)(e+ I) =ac+ I= ca + I= (c+ Ia + I). Hence, 
R/Tis commutative. 

(3) The identity in R/J is the coset 1p + I because (a + D(lp +) = 
alr + f= a+TJand similarly (lg + Da@at+N=at+l B 


The ring R/T is called the quotient ring (or factor ring) of R by £ One sometimes 
speaks of factoring out the ideal F to obtain the quotient ring R/T. 


Homomorphisms 


Quotient rings are the natural generalization of congruence-class arithmetic in Z and 
F[x]. As is often the case in mathematics, however, a concept developed with one idea 
in mind may have unexpected linkages with other important mathematical concepts. 
That is precisely the situation here. We shall now see that the concept of homomor- 
phism that arose in our study of isomorphism of rings in Chapter 3 is closely related 
to ideals and quotient rings. 


Definition Let f:R —> 5 be a homomorphism of rings. Then the kernel of f is the set 
= {rER| f(r) = 0,}. 


Thus, the kernel of f is the subset of R consisting of those elements of R that 
f maps to Os in S. Note that Og is in the kernel since f(0pg) = 0; by Theorem 3.10. 
However, the kernel may also contain nonzero elements. 
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EXAMPLE 4 


In Example 6 of Section 3.3 we saw that the function f:Z — Z, defined by 

f(r) = [r] E Z is a homomorphism of rings. Its kernel K contains many nonzero 
integers. For instance, 12 € K because f(12) = [12] = [0] in Zs. In fact every 
multiple of 6 is in the kernel because 


= {reZ| f) = 0} = {reZ |i = [03 WPefiniriona A 
= {re Z|r=0(mod6)} [Theorem 2.3] 
= {reZ|ó|r} [Definition of congruence mod 6) 
= fall multiples of 6} [6 | r means r is a multiple of 6]. 


So the kernel K is the principal ideal (6) in Z. 


EXAMPLE 5 


The function @:R[x] — R that sends each polynomial in R[x] to its constant 
term in R is a ring homomorphism (Exercise 1). Its kernel consists of all 
polynomials with constant term 0. But every polynomial with 0 constant term 
is divisible by x. So the kernel is the principal ideal (x) in R[x]. 


Examples 4 and 5 provide examples of the following theorem. 


Theorem 6.10 


Let f:R + S be a homomorphism of rings. Then the kernel K of f is an ideal in 
the ring A. 


Proof»We shall use Theorem 6.1 to show that K = {rE R| f(r) = 0s} is an ideal. 
We must verify that is a nonempty subset of R that is closed under sub- 
traction and absorbs products. First, Kis nonempty because 0, E K as 
noted before Example 4. To prove that Kis closed under subtraction, we 
must show that for a, b E K, the element a — b is also in K. To show 
a — b E K, we must show that f(a — b) = Os. This follows from the fact 
that fis ahomomorphism and that f(a) = Os and f(b) = Os (because a, 
be K): 


Ja — b) = f(a) — f(b) = 0s — 0s = Os. 


To prove that K absorbs products we must first verify that ra © K for any 
re Rand ae XK, that is, that f(ra) = 0s; here’s the proof: 


flra) = Aa) = f) 0s = 


A similar argument shows that ar € K. Therefore K is an ideal by 
Theorem 6.1. B 
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In Examples 4 and 5, the kernel of the homomorphism contained many nonzero 
elements. Sometimes, however, the kernel of a homomorphism contains only Og, in 
which case we have an interesting result. 


Theorem 6.11 


Let f:R + S be a homomorphism of rings with kernel K. Then K = (Op) if and 
only if fis injective. 


Proof» Suppose that K = (0g). We must show that f is injective, so assume 
that a, b € R and f(a) = f(b). Because f is a homomorphism, 
Jla — b) = f(a) — f(b) = Os. Hence, a — b isin the kernel K = (0p), 
which means that a — b = Og and a = b. Therefore f is injective. 

Conversely, suppose f is injective. If c € K, we must show that c = Op. 

By the definition of the kernel, f(c) = 0s. By Theorem 3.10, f(Oz) = 05 = 
fle). Therefore, c = Og because f is injective. Hence, the kernel consists 
of the single element Og, that is, K = (0p). E 


EXAMPLE 6 


In Example 7 of Section 3.3 we saw that the function g:R — M(R) given by 
g(r) = ( s A is a ring homomorphism. Its kernel of g consists of all real 
ar 


0 0 0 0 
numbers r such that g(r) = i= a that is, such that ( ) = ( i 
0 0 hn OF 0 0 


This can only occur when r = 0. So the kernel is the zero ideal (0). Hence, g is 
injective by Theorem 6.11. 


Theorem 6.10 states that every kernel is an ideal. Conversely, every ideal is the 
kernel of a homomorphism: 


Theorem 6,12 


Let / be an ideal in a ring R. Then the map 7:R > R// given by a(r} =r + lis 
a surjective homomorphism with kernel /. 


The map 7 is called the natural homomorphism from R to R/T. 


Proof of Theorem 6.12 » The map 7 is surjective because given any coset r + Jin 
R/T, w(r) = r + I. The definition of addition and multiplication in R/T 
shows that 7 is a homomorphism: 

mr +s =(r+s)+I= (r+) +s+ I) = alr) + 79); 
mlrs) =rs + I= (r+ ID(s + I) = T(r) als). 
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The kernel of 7 is the set of elements rE R such that a(r) = Og + T 

(the zero element in R/Z). However, m(r) = Og + Jif and only if r + I = 
Oz + J, which occurs if and only if r = 0g (mod J), that is, if and only if 
rE I. Therefore, Tis the kernel of 7. m 


The natural homomorphism 7 in Theorem 6.12 is a special case of a more general 
situation. If fR — S is a surjective homomorphism of rings, we say that S is a 
homomorphic image of R. If fis actually an isomorphism (so that S is an isomorphic 
image of R), then we know that R and S have identical structure. Whenever one 
of them has a particular algebraic property, the other one has it too. If fis not an 
isomorphism, then properties of one ring may not hold in the other. However, the 
properties of S and the homomorphism f often give us some useful information 
about R. An analogy with sculpture and photography may be helpful: If fR + S 
is an isomorphism, then S is an exact, three-dimensional replica of R. If fis only a 
surjective homomorphism, then S is a two-dimensional photographic image of R in 
which some features of R are accurately reflected but others are distorted or missing. 
The next theorem tells us precisely how R, S, and the kernel of fare related in these 
circumstances. 


Theorem 6.13 First lsomorphism Theorem 


Let £R — S be a surjective homomorphism of rings with kernel K. Then the 
quotient ring R/K is isomorphic to S. 


The theorem states that every homomorphic image of a ring R is isomorphic to a 
quotient ring R/K for some ideal K. Thus if you know all the quotient rings of R, then 
you know all the possible homomorphic images of R. The ideal K measures how much 
information is lost in passing from the ring R to the homomorphic image R/K. When 
K = (0g), then fis an isomorphism by Theorem 6.11, and no information is lost. But 
when K is large, quite a bit may be lost. 


Proof of Theorem 6.13 » We shall define a function g from R/K to S and then 
show that it is an isomorphism. To define œ, we must associate with 
each coset r + K of R/K an element of S. A natural choice for such an 
element would be f(r) E S; in other words, we would like to define 
g:R/K > S by the rule g(r + K) = f(r). The only possible problem is that 
a coset can be labeled by many different elements of R. So we must show 
that the value of @ depends only on the coset and not on the particular 
representative r chosen to name it. If r + K = t + K, thenr = t (mod 
K) by Theorem 6.6, which means that r — t E K by the definition of 
congruence. Consequently, since f is a homomorphism, f(r) — f(z) = 
f(r — t) = Og. Therefore, r + K = t + K implies that f(r) = f(£). It 
follows that the map ©:R/K — S given by the rule g(r + K) = f(r) isa 
well-defined function, independent of how the coset is written. 
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If se S, then s = f(r) for some rE R because fis surjective. Thus 
s = f(r) = g(r + K), and ¢ is surjective. To show that ¢ is injective, we 
assume that g(r + K) = (c + K) and show that r + K = c + K, as follows: 


elr + K) = (c + K) 
SO =fO [Definition of ¢] 


f(r) — fe) = Os 
f(r — e) = Os. Lf is a homomorphism.] 


Thus, r — ce Kand hence, r = c (mod K). Sor + K=c+ Kby 
Theorem 6.6. Therefore, ¢ is injective. 
Finally, ¢ isa homomorphism because fis 


gl(c + Kd + K)] = pled + K) = fled) = NAA) 
= (c + K)g(d + K) 


and 


l(c + K) + (d + K)] = gf(c + d) + K] = f(c + d) = f(e) + Ka) 
= (c + K)+9(d+ K). 


Therefore, p:R/K — S is an isomorphism. W 


The First Isomorphism Theorem is a useful tool for determining the structure of 
quotient rings, as illustrated in the following examples. 


EXAMPLE 7 


In the ring Z[x], the principal ideal (x) consists of all multiples of x, that is, 

all polynomials with constant term 0. What does the quotient ring Z[x]/(x) 
look like? We can answer the question by using the function §:Z[x] > Z, 

which maps each polynomial to its constant term. The function 6 is certainly 
surjective because each k €Z is the image of the polynomial x + k in Z[x]. 
Furthermore, 0 isa homomorphism of rings (Exercise 1). The kernel of 6 
consists of all those polynomials that are mapped to 0, that is, all polynomials 
with constant term 0. Thus the kernel of @ is the ideal (x). By Theorem 6.13 the 
quotient ring £[x]/(x) is isomorphic to Z. 


EXAMPLE 8 


Let T be the ring of functions from R to R and 7 the ideal of all functions 

g such that g(2) = 0. In Example 13 of Section 6.1 we saw that T/F con- 
sists of the cosets f, + J, one for each real number r, where f;:R — R is the 
constant function given by f,(x) = r for every x. This suggests the possibility 
that the quotient ring T/F might be isomorphic to the field R. We shall use 
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Theorem 6.13 to show that this is indeed the case by constructing a surjective 
homomorphism from T to R whose kernel is the ideal J. Let ¢:T — R be the 
function defined by o(f) = /(2). Then @ is surjective because for every real 
number r, r = f,(2) = o(f,). Furthermore, o is a homomorphism of rings: 


o(f + h) = (f + AY2) = f(2) + K2) = off) + GA) 
Sh) = (AAD = f(2)h(2) = APH. 
By definition, the kernel of ¢ is the set 
{gET|¢(g) = 0} = {gE T|g(2) = 03. 
Thus the kernel is precisely the ideal 7. By Theorem 6.13, 7//Jis isomorphic to R. 


EXAMPLE 9 


What do the homomorphic images of the ring Z look like? To answer this 
question, suppose that f:Z — S is a surjective homomorphism. If f is actually 
an isomorphism, then S looks exactly like Z, of course (in terms of algebraic 
structure). If fis surjective, but not an isomorphism (that is, not injective), then 
the kernel K of fis a nonzero ideal in Z by Theorem 6.11. Since K is an ideal 

in Z, K must be a principal ideal, say K = (n) for some n # 0, by Exercise 40 

in Section 6.1, By Theorem 6.13, S is isomorphic to Z/K = £/(n) = Z„ Thus 
every homomorphic image of Z is isomorphic either to Z or to Z, for some n. 


E Exercises 


A. 1. Show that the map @:R[x] > R that sends each polynomial f(x) to its constant 
term is a surjective homomorphism. 


2. Show that every homomorphic image of a field F is isomorphic either to F 
itself or to the zero ring. [Hint: See Exercise 10 in Section 6.1 and Exercise 7 
below.] 


3. If Fis a field, R a nonzero ring, and f:F— R a surjective homomorphism, 
prove that f is an isomorphism. 


4. Let [a], denote the congruence class of the integer a modulo n. 


(a) Show that the map /:7,, > Z, that sends [a];; to [a], is a well-defined, 
surjective homomorphism. 


(b) Find the kernel of f. 


5. Let J be an ideal in an integral domain R. Is it true that R/J is also an integral 
domain? 


6. The function g:R[x] > R given by ¢( f(x)) = f(2) is a homomorphism of 
rings by Exercise 24 of Section 4.4 (with a = 2). Find the kernel of o. [Hint: 
Theorem 4.16.] 
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7. 
8. 


10. 


11. 


16. 


17. 


18. 


19. 


Ideals and Quotient Rings 


If Ris a ring, show that R/(0g) = R. 


Let R and S be rings. Show that 7:R X S —> R given by (r, s) = risa 
surjective homomorphism whose kernel is isomorphic to S. 


. R= {( ) |a, b,cE z} is a ring with identity by Example 19 
c 
in Section 3.1. 
(a) Show that the map f:R — Z given by 403 a = ais a surjective 
homomorphism. É 


(b) What is the kernel of f? 


(a) Let f:R —> S be a surjective homomorphism of rings and let I be an ideal 
in R. Prove that f(T) is an ideal in S, where f(T) = {s © S|s = f(a) for 
some aE Ñ}. 


(b) Show by example that part (a) may be false if fis not surjective. 


Z[V2]is a ring by Exercise 13 of Section 3.1. Let f:2[V2] > Z[ V2] be the 
function defined by f(a + bV2) = a — bV2. 


(a) Show that fis a surjective homomorphism of rings. 


(b) Use Theorem 6.11 to show that fis also injective and hence is an 
isomorphism. [You may assume that ‘V2 is irrational] 


. Let J be an ideal in a noncommutative ring R such that ab — ba E I for all 


a, bE R. Prove that R/F is commutative. 


. Let J be an ideal in a ring R. Prove that every element in R/J has a square root 


if and only if for every a E R, there exists b € R such that a — b'e I. 


. Let J be an ideal in a ring R. Prove that every element in R/J is a solution of 


x? = x if and only if for every aE R, a* — aE TI. 


. Let Z be an ideal in a commutative ring R. Prove that R/J has an identity if 


and only if there exists e € R such that ea — ac I for every aE R. 


Let J # R be an ideal in a commutative ring R with identity. Prove that R/J is 
an integral domain if and only if whenever abe J, either aE I or bel. 


Suppose J and J are ideals in a ring R and let f:R — R/I X R/J be the 
function defined by f(a) = (a + Lat J). 


(a) Prove that fis a homomorphism of rings. 
(b) Is f surjective? [Hint: Consider the case when R = Z, I = (2), J = (4).] 
(c) What is the kernel of f? 


Let R be a commutative ring with identity with the property that every ideal 
in R is principal. Prove that every homomorphic image of R has the same 
property. 

Let Zand K be ideals in a ring R, with K & I. Prove that I/K ={a + Kja EI} is 
an ideal in the quotient ring R/K. 


(Copyright 2012 Cangage Learning AN Gights Ramrvat May not be pied, eama oc duplicate, in whale or ta part Dus to electronic cights, maa third perty Cutest mey be axppmmad Gran the eBook ador eChageer(s). Ediawial exview has 
Ged that coy Capen) atat does not materially affect the oveni earning axpetare Cagea Learning masrves the right to save efifiriconel cutest of aay tune if subaya tipit» cemrictirms require it. 


6.2 Quotient Rings and Homomorphisms 161 


20. Let f:R — S be a homomorphism of rings with kernel K. Let J be an ideal 
in R such that ZS K. Show that f:R/I > S given by f(r + J) = f(r) is a well- 
defined homomorphism. 


21. Use the First Isomorphism Theorem to show that Z29/(5) = Zs. 


22. Let f:R — S be a homomorphism of rings. If J is an ideal in S and J = 
{re R| f(r) € J], prove that Tis an ideal in R that contains the kernel of f. 


23. (a) Let Rbea ring with identity. Show that the map f:Z — R given by 
f(k) = klp is a homomorphism. 


(b) Show that the kernel of fis the ideal (n), where n is the characteristic of 
R. [Hint: “Characteristic” is defined immediately before Exercise 41 of 
Section 3.2. Also see Exercise 40 in Section 6.1.] 


24. Find at least three idempotents in the quotient ring Q[x]/(x* + x’). 
[See Exercise 3 in Section 3.2.] 


25. Let R be a commutative ring and J the ideal of all nilpotent elements of R 
(as in Exercise 30 of Section 6.1). Prove that the quotient ring R/J has no 
nonzero nilpotent elements. 


26. Let Sand [be as in Exercise 41 of Section 6.1. Prove that S/I = Z}. 

27. Let T and Ibe as in Exercise 42 of Section 6.1. Prove that T/I = Z,. 

28. Let T and J be as in Exercise 44 of Section 6.1. Prove that T/I = R. 

29. Let S and I be as in Exercise 45 of Section 6.1. Prove that S/I = R x R. 


C. 30. (The Second Isomorphism Theorem) Let J and J be ideals in a ring R. Then 
IO Jisan ideal in J, and J is an ideal in J + J by Exercises 19 and 20 of 
I 25 
Section 6.1. Prove tha- = = i [Hint: Show that f:I —> (J + J)/J given 


by f(a) = a + Jisa surjective ae with kernel ZN J.) 


31. (The Third Isomorphism Theorem) Let J and K be ideals in a ring R such that 
KGI Then J/K isan ideal in R/K by Exercise 19. Prove that (R/K)/(/K) = 
R/T. [Hint: Show that the map f:R/K > R/I given by f(r + K) =r + Iis a well- 
defined surjective homomorphism with kernel 7/K.] 


32. (a) Let K be an ideal in a ring R. Prove that every ideal in the quotient 
ring R/K is of the form J/K for some ideal Jin R. [Hint: Exercises 19 
and 22.] 


(b) If #:R — Sis a surjective homomorphism of rings with kernel K, prove 
that there is a bijective function from the set of all ideals of S to the set of 
all ideals of R that contain K. [Hint: Part (a) and Exercise 10.] 


EXCURSION: The Chinese Remainder Theorem for Rings 


(Section 14.3) may be covered at this point if desired. 
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63 The Structure of R/I When / Is Prime or Maximal* 


Quotient rings were developed as a natural generalization of the rings Z, and F[x]/(p(x)). 
When p is prime and p(x) irreducible, then Z, and F[x]/ (p(x)) are fields. In this section 
we explore the analogue of this situation for quotient rings of commutative rings. We 
shall determine the conditions necessary for a quotient ring to be either an integral 
domain or a field. 

Primes in Z and irreducibles in F[x] play essentially the same role in the structure 
of the congruence class rings. Our first task in arbitrary commutative rings is to find 
some reasonable way of describing this role in terms of ideals. According to Theorem 1.5, 
a nonzero integer p (other than +1) is prime if and only if p has this property: 
Whenever p | bc, then p |b or p |c. To say that p |a means that ais a multiple of p, that 
is, a is an element of the principal ideal (p) of all multiples of p. Thus this property of 
primes can be rephrased in terms of ideals: 


If p + 0, +1, then pis prime if and only if 
whenever bc E (p), then b E (p) or cE (p). 


The condition p + +1 guarantees that 1 is not a multiple of p and, hence, that the ideal 
(p) is not all of Z. Using this situation as a model, we have this 


Definition An ideal P inacommutative ring Ris said to be prime if P # Rand whenever 


be EP, thenbePorceP. 


EXAMPLE 1 


As shown above, the principal ideal (p) is prime in Z whenever p is a prime 
integer. On the other hand, the ideal P = (6) is not prime in Z because 
2°3eP but 2¢ Pand3¢P. 


EXAMPLE 2 


The zero ideal in any integral domain R is prime because ab = Og implies 
a = Ogor b = Opr 


EXAMPLE 3 


The implication (1) = (2) of Theorem 4.12 shows that if F is a field and p(x) is 
irreducible in F[x], then the principal ideal (p(x)) is prime in FL]. 


“This section is not used in the sequel and may be omitted if desired. 
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EXAMPLE 4 


Let I be the ideal of polynomials with even constant terms in Z[x]. Then Fis not 
principal (Example 8 of Section 6.1) and clearly 7+ Z[x]. Let f(x) = a,x" + +++ + a 
and g(x) = 5,,x” + +++ + By be polynomials in Z[x] such that f(2) g(x) €I. Then the 
constant term of /(x)g(x), namely ağ, must be even. Since the product of two odd 
integers is odd, we conclude that either ay is even (that is, f(x) € J) or dy is even (that 
is, g(x) € J). Therefore, J is a prime ideal. 


The ideal J in Example 4 is prime, and the quotient ring Z[x]/J is a field (see 
Example 3 of Section 6.2). Similarly, Z/(p) = Z, is a field when p is prime. However, 
the next example shows that R/P may not always be a field when P is prime. 


EXAMPLE 5 


The principal ideal (x) in the ring Z[x] consists of polynomials that are mul- 
tiples of x, that is, polynomials with zero constant terms. Hence, (x) # Z[x]. If 
Jx) = a + +++ + aand g(x) = 5 x™ + +++ + by and f(x)g(x) E, then the 
constant term of /(x)g(x), namely ao, must be 0. This can happen only if 

= Oor by = 0, that is, only if f(x) € (x) or g(x) E (x). Therefore, (x) is a prime 
ideal. However, Example 7 of Section 6.2 shows that the quotient ring Z[x]/(x) 
is isomorphic to Z. Therefore, Z[x]/(x) is an integral domain but not a field. 


In light of Example 5, the next theorem is the best we can do with prime ideals. 


Theorem 6.14 


Let P be an ideal ina commutative ring R with identity. Then P is a prime ideal 
if and only if the quotient ring R/P is an integral domain. 


Proof » 1f P is any ideal in R, then by Theorem 6.6, a + P = 0p + P in R/Pif 
and only if a = 0g (mod P). Furthermore, a = Og (mod P) if and only if 
ac P. So we have this useful fact: 


(#) a + P= 0r + Pin R/P if and only if aeP. 


Suppose P is prime. By Theorem 6.9, R/P is a commutative ring 
with identity. In order to prove that R/P is an integral domain, we must 
show that its identity is not the zero element and that it has no zero 
divisors. Since P is prime, P # R. Consequently, 1p P because any ideal 
containing lg must be the whole ring. However, 1,¢ P implies that 
1p + P #0, + Pin R/P by (+). Now we show that R/P has no zero 
divisors. If (b + P)(c + P) = Og + P, then bc + P = Op + P and bc EP 
by (+). Hence bE P or ce E Thus b +P = Og + Pore + P= 0g + P, so 
that R/P has no zero divisors. Therefore R/P is an integral domain. 
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Now assume that R/P is an integral domain. Then by definition 
lr + P # Op + Pand hence 1p¢ P by (+). Therefore P + R. To complete 
the proof that P is prime we assume that bc € P and show that bE P or 
cE P. Nowif bce P, then in R/P we have (b + P)(e+ P) = be + P= 
Or + P by (+). Thus b + P = Og + P or c + P = Opg + P because R/ P has 
no zero divisors. Hence b € P or c E€ P by (+). Therefore P is prime. W 


Since the quotient ring modulo a prime ideal is not necessarily a field, it is natural 
to ask what conditions an ideal must satisfy in order for the quotient ring to be a field. 


EXAMPLE 6 


Consider the ideal (3) in Z. We lnow that Z/(3) = Z, is a field. Now consider 
the ideal (3). Suppose J is an ideal such that (3) = J&Z. If J + (3), then there 
exists a € J with a¢ (3). In particular, 3 ¥ a, so that 3 and a are relatively prime. 
Hence, there are integers u and v such that 3u + av = 1. Since 3 and a are in 
the ideal J, it follows that 1 € J. Therefore J = Z by Exercise 9 of Section 6.1, 
and so there are no ideals strictly between (3) and Z. 


EXAMPLE 7 


The quotient ring Z[x]/(x) is not a field (Example 5). Furthermore, the ideal 7 
of polynomials with even constant terms lies strictly between (x) and Z[x], that 
is, (x) G IG Z[x]. 


Here is a formal definition of the property suggested by these examples: 


Definition An ideal M in a ring R is said to be maximal if M # Rand whenever d is an 
ideal such that Mc /CR, then M=Jord=R. 


Example 6 shows that the ideal (3) is maximal in Z and Example 7 shows that the 
ideal (x) is not maximal in Z[x]. Note that a ring may have more than one maximal 
ideal. The ideal {0, 2, 4} is maximal in Ze, and so is the ideal {0, 3}. There are infinitely 
many maximal ideals in Z (Exercise 3). Maximal ideals provide the following answer 
to the question posed above: 


Theorem 6,15 


Let M be an ideal in a commutative ring R with identity. Then M is a maximal 
ideal if and only if the quotient ring R/M is a field. 


Proof» We shall use the same fact that was used in the proof of Theorem 6.14: 


e) a+M=0r+MinR/M _ if andonlyif aeM. 


Crai 2012 Cees Lames: AX Ripley Gaard Oty a be copie jeta or daptic aan, in whale or ia part Dus to elecrrunic dghts, anme third party coment may te mppmaad fran the «Block eaicr eChapar(a). Edimrial seview bas 
‘het eny Upped crates does not materiaDy affect the oveni laruing axpariare Cangage l amming Marva the right to cea? etiicionel ciast at any tune if suhag rights murite Teque it. 


6.3 The Structure of RII When / Is Prime or Maximal 165 


Suppose R/M is a field. Then by definition 1p + M + Og + Mand 
hence 1p Æ M by (+). Therefore M + R. To show that M is maximal, we 
assume that J is an ideal with M S JS R and show that M = Jor 
J = R. If M = J, there is nothing to prove. If M # J, then there exists 
acJwitha¢M. Hence a + M + 0g + Min the field R/M, and a + M 
has an inverse b + M such that (a + M)(b + M)=ab+ M=1p+M. 
Then ab = 1p(mod M) by Theorem 6.6, so that ab — 1p = m for some 
mE M. Thus lp = ab — m. Since a and mare in the ideal J, it follows 
that 1pEJ and J = R. Therefore M is a maximal ideal. 

Now assume is a maximal ideal in R. By Theorem 6.9, R/Mis acom- 
mutative ring with identity. In order to prove that R/M is a field, we first 
show that its identity is not the zero element. Since M is mawmal, M + R. 
Consequently, 1x ¢ M because any ideal containing 1 p must be the whole 
ring. However, 1p M implies that 1p + M # Og + Min R/M by (+). 

Next we show that every nonzero element of R/M has a multiplicative 
inverse. If a + M is a nonzero element of R/M, then a ¢ M (otherwise a + M 
would be the zero coset). The set 


J= {m + ra|reRandmeM) 


is an ideal in R that contains M by Exercise 34 of Section 6.1. Furthermore, 
a = Ôg + lgaisin J, so that M + J. By maximality we must have J = R. 
Hence 1R EJ, which implies that 1p = m + ca for some mE M and cER. 
Note that ca — 1p = —m E M, so that ca = 1p (mod M), and hence 

ca + M=1p+ M by Theorem 6.6. Consequently, the coset 

c + Mis the inverse of a + Min RJM: 


(c+ MYa+ M)=ca+ M=1p+M. 


So every nonzero element of R/M is a unit (Axiom 12 is satisfied). 
Therefore, R/M is a field. W 


Corollary 6.16 


In a commutative ring R with identity, every maximal ideal is prime. 


Proof» If M is a maximal ideal, then R/M is a field by Theorem 6.15. Hence, 
R/M is an integral domain by Theorem 3.8. Therefore, M is prime by 
Theorem 6.14. E 


Theorem 6.15 can be used to show that several familiar ideals are maximal. 


EXAMPLE 8 


The ideal 7 of polynomials with even constant terms in Z[x] is maximal because 
2[x]/T is a field (see Example 3 of Section 6.2). 
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Ideals and Quotient Rings 


EXAMPLE 9 


Let T be the ring of functions from R to R and let J be the ideal of all functions 
g such that g(2) = 0. In Example 8 of Section 6.2 we saw that 7/Tis a field 
isomorphic to R. Therefore, J is a maximal ideal in T. 


E Exercises 


A. 1. 
2. 


If n is a composite integer, prove that (7) is not a prime ideal in Z. 


If R is a finite commutative ring with identity, prove that every prime ideal in 
R is maximal. [Hint: Theorem 3.9.] 


. (a) Prove that a nonzero integer p is prime if and only if the ideal (p) is 


maximal in Z. 


(b) Let F bea field and p(x) € F[x1. Prove that p(x) is irreducible if and only if 
the ideal (p(x)) is maximal in F[x]. 


. Let R be a commutative ring with identity. Prove that R is an integral domain 


if and only if (0p) is a prime ideal. 


5. List all maximal ideals in Z,. Do the same in Z)). 


6. (a) Show that there is exactly one maximal ideal in Zg. Do the same for Z4. 


7. 


14. 


[Hint: Exercise 6 in Section 6.1.] 
(b) Show that Z, and Z,; have more than one maximal ideal. 


Let R be a commutative ring with identity. Prove that R is a field if and only if 
(Og) is a maximal ideal. 


. Give an example to show that the intersection of two prime ideals need not be 


prime. [Hint Consider (2) and (3) in Z.] 


. Let R be an integral domain in which every ideal is principal. If (p) is a 


nonzero prime ideal in R, prove that p has this property: Whenever p factors, 
p = ed, then c or disa unit in R. 


. Let p be a fixed prime and let J be the set of polynomials in Z[x] whose 


constant terms are divisible by p. Prove that J is a maximal ideal in 7[x]. 


. Show that the principal ideal (x — 1) in Z[x] is prime but not maximal. 


. If p is a prime integer, prove that M is a maximal ideal in Z X Z, where M = 


{(pa, b) |a, be Z}. 


. If Z is an ideal in a ring R, then J X Tis an ideal in R X R by Exercise 8 of 


Section 6.1. Prove that (R X R)/(I X I) is isomorphic to R/I X R/T. 
[Hint: Show that the function f:R X R—» R/I X R/I given by f(a, 5)) = 
(a + I, b + I) is a surjective homomorphism of rings with kernel J X 1] 


If P is a prime ideal in a commutative ring R, is the ideal P X P a prime ideal 
in R X R? [Hint- Exercise 13.] 
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15. (a) Let R be the set of integers equipped with the usual addition and 
multiplication given by ab = 0 for all a, b E R. Show that Risa 
commutative ring. 

(b) Show that M = {0, +2, +4, +6, . . . } is a maximal ideal in R that is not 
prime. Explain why this result does not contradict Corollary 6.16. 
16. Show that M = {0, +4, +8, . . . } is a maximal ideal in the ring £ of even 


integers but E/M is not a field. Explain why this result does not contradict 
Theorem 6.15. 


17. Let f:R — S bea surjective homomorphism of commutative rings. If J is a 
prime ideal in S, and J= {r E R | f(r) € J}, prove that J is a prime ideal in R. 

18. Let P be an ideal in a commutative ring R with P # R. Prove that P is prime 
if and only if it has this property: Whenever 4 and B are ideals in R such that 
ABG P, then A € Por B&P. [AB is defined in Exercise 36 of Section 6.1. This 
property is used as a definition of prime ideal in noncommutative rings.] 

19. Assume that when R is a nonzero ring with identity, then every ideal of 
Rexcept R itself is contained in a maximal ideal (the proof of this fact is 
beyond the scope of this book). Prove that a commutative ring R with identity 
has a unique maximal ideal if and only if the set of nonunits in R is an ideal. 
Such a ring is called a local ring. (See Exercise 6 of Section 6.1 for examples of 
local rings.) 


20. Find an ideal in Z X Z that is prime but not maximal. 
C.21. (a) Prove that R = {a + bila, b EZ} is a subring of C and that 
M = {a + bi|3|a and 3] 5} 


is a maximal ideal in R. [Hint: If r + si¢ M, then 3 Xr or 3 4s. Show 
that 3 does not divide r? + 3 = (r + si)(r — si). Then show that any ideal 
containing r + si and M also contains 1.] 


(b) Show that R/M is a field with nine elements. 


22. Let R be as in Exercise 21. Show that J is not a maximal ideal in R, where J = 
{a + bi|5|aand 5 |b}. [Hint: Consider the principal ideal K = (2 + 4) in R] 


23. If R and J are as in Exercise 22, show that R/J = Z; X Zs. 
24. If R and K areas in Exercise 22, show that R/K = Zs. 


. Prove that T = {a + bV2\a,bEZ} is a subring of Rand M = 
{a + bV2|5|a and 5|b} is a maximal ideal in T. 


N 
A 


ALTERNATIVE ROUTES: At this point there are three possibilities. 
You may explore a new algebraic concept, groups (Chapter 7)—if you 
have not already done so—or continue further with either integral 
domains (Chapter 10) or fields (Chapter 11). 
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CHAPTER 7 


Groups 


The algebraic systems with which you are familiar, such as Z, Z,, the rational 
numbers, the real numbers, and other rings all have two operations: addition and 
multiplication. In this chapter, we introduce a different kind of algebraic structure— 
called a group—that uses a single operation. Groups arise naturally in the study of 
symmetry, geometric transformations, algebraic coding theory, and in the analysis 
of the solutions of polynomial equations. 


ALTERNATE ROUTE: If you have not read Chapter 3 (Rings), you 
should replace Section 7.1 with Section 7.1.A, which begins on page 183. 


cag Definition and Examples of Groups 


A group is an algebraic system with one operation. Some groups arise from rings by 
ignoring one of their operations and concentrating on the other. As we shall see, for 
example, the integers form a group under addition (but not multiplication) and the 
nonzero rational numbers form a group under multiplication (but not addition). But 
many groups do not arise from a system with two operations. The most important of 
these latter groups (the ones that were the historical starting point of group theory) 
developed from the study of permutations * Consequently, we begin with a consider- 
ation of permutations. 

Informally, a pernmtation of a set Tis just an ordering of its elements. For example, 
there are six possible permutations of T = {1, 2, 3}: 


123 132 213 231 312 321% 


*In the early nineteenth century, permutations played a key role in the attempt to find formulas for 
solving higher-degree polynomial equations similar to the quadratic formula. For more information, 
see Chapter 12. 
169 
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Each such ordering determines a bijective function from T to T; map 1 to the first 
element of the ordering, 2 to the second, and 3 to the third.* For instance, 2 3 1 
determines the function f: T— T whose rule is f(1) = 2; f(2) = 3; f(3) = 1. Conversely, 
every bijective function from T to T defines an ordering of the elements, namely, f (1), 
Ff (2), {(3). Consequently, we define a permutation of a set T to be a bijective function 
from T to T. This definition preserves the informal idea of ordering and has the advan- 
tage of being applicable to infinite sets. For now, however, we shall concentrate on finite 
sets and develop a convenient notation for dealing with their permutations. 


EXAMPLE 1 


Let T = {1, 2, 3}. The permutation f whose rule is f(1) = 2, f(2) = 3, f(3) = 1 


123 
may be represented by the array G 3 4) in which the image under fof an 


element in the first row is listed immediately below it in the second row. Using 
this notation, the six permutations of T are 


Ca ee collet 
r 2 3JWN. 8 YN a 3 

Cx G Tang i) 

2 3y awe Tae a y 
Since the composition of two bijective functions is itself bijective, the composi- 
tion of any two of these permutations is one of the six permutations on the list 

: : 12 
above. For instance, if f = G 2 2 and g = ( ; a) then f° g is the func- 
tion given by 2 
(fe) =f(e@) =fQ) =2 


(f° 82) = fE) =f) = 3 
(f° 8X3) =f(g3)) =£G) = 1. 


123 
Thus f° g = $ 3 i) It is usually easier to make computations like this 


by visually tracing an element’s progress as we first apply g and then f; for 
example, 


fa gee 
TO a mat 


*Bijective functions are discussed in Appendix B. 
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If we denote the set of permutations of T by S}, then composition of functions 
(e°) is an operation on the set S, with this property: 


If fE S; and g E S; then fo gE S3. 
Since composition of functions is associative,* we see that 


(f°g)°h =fe(goh) for all f, g, h E S3. 


t23 
Verify that the identity permutation J = ( ) has this property: 


123 


Iof=f and foI=f_ foreveryfEsS,. 


Every bijection has an inverse function;* consequently, 


if fE Sz, then there exists g € S, such that 
fee =] and gof= L 


1 
For instance, if f = G í a) then g = G ; i) because 


wE eet co 
3 tf awe & 4 1 2 3 
and 
e | ees F 
a 3 Dw 1 By N O BP 


You should determine the inverses of the other permutations in S, (Exercise 1). 
Finally, note that fo g may not be equal to g ° f; for instance, 


Goa de Tees | 
ae 2. | 2 | 3 2.3) 1l 


but 


*See Appendix B. 
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By abstracting the key properties of $, under the operation œ, we obtain this 


Definition A group is a nonempty set G equipped with a binary operation + that satis- 
fies the following axioms: 

1. Closure: Ifae G and beG, thenaxbeG. 

2. Associativity: a x(b«c) = (ax 6) «efor alla, b, ceG. 


3. There is an element ee G (called the identity element) such that 
axe=a=e-aforeveryacG, 


4, For each ae G, there is‘an element de G (called the inverse of a) 
such thataxd =eandd*a=e, 
A group is said to be abelian‘ if it also satisfies this axiom: 


5. Commutativity: a +b = b x aforalla, bes. 


A group Gis said to be finite (or of finite order) if it has a finite number of elements. 
In this case, the number of elements in G is called the order of G and is denoted [G]. A 
group with infinitely many elements is said to have infinite order. 


EXAMPLE 2 


The discussion preceding the definition shows that S, is a nonabelian group of 
order 6, with the operation + being composition of functions. 


EXAMPLE 3 


The permutation group S; is just a special case of a more general situation. Let 
n be a fixed positive integer and let T be the set {1, 2, 3, . . . , n}. Let S, be the set 
of all permutations of T (that is, all bijections T— T). We shall use the same 


notation for such functions as we did in S$. In Sç, for instance, K z : v 


denotes the permutation that takes 1 to 4, 2 to 6, 3 to 2, 4 to 3, 5 to 5, and 6 to 
1. Since the composite of two bijective functions is bijeçtive, S, is closed under 
the operation of composition. For example, in S, 
) p ( 123 43 6 
645 m r3 


a 
pe 


„* Sei 
` 
` 


peren pas 
+ 2 
Bde. SF ME A 


ore 
NE T o O ae 
iina AEN E M i 


(Remember that in composition of functions, we apply the right-hand function 
first and then the left-hand one. in this case, for instance, 4 + 3 > 2, as shown 


tBinary operations are defined in Appendix B. 
tin honor of the Norwegian mathematician N. H. Abel (1802-1829). 
§See Appendix B. 
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by the arrows.) We claim that S, is a group under this operation. Composition 
of functions is known to be associative, and every bijection has an inverse func- 
tion under composition.t It is easy to verify that the identity permutation 


123 
group on # symbols, The order of S, is n! = n(n — 1)(# — 2)... 2.1 (Exercise 20). 


EXAMPLE 4 


123 n 
( ati 4 is the identity element of S,. S, is called the symmetric 


The preceding example is easily generalized. Let T be any nonempty set, possi- 
bly infinite, Let A(T) be the set of all permutations of T (all bijective functions 
T— T). The arguments given above for S, carry over to A(T) and show that 

A(T) is a group under the operation of composition of functions (Exercise 12). 


EXAMPLE 5 


Think of the plane as a sheet of thin, rigid plastic. Suppose you cut out a square, 
pick it up, and move it around,* then replace it so that it fits exactly in the cut-out 
space. Eight ways of doing this are shown below (where the square is centered at 
the origin and its corners numbered for easy reference). We claim that any mo- 
tion of the square that ends with the square fitting exactly in the cut-out space 
has the same result as one of these eight motions (Exercise 14). 


All Rotations Are Taken Counterclockwise Around the Center 


fy = rotation of 0° 


4 


2 
rı = rotation of 90° 


tSee Appendix B for details. 
tFlip it, rotate it, turn it over, spin it, do whatever you want, as long as you don’t bend, break, or distort it. 
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n = rotation of 180° 


2 4 


r, = rotation of 270° 


d = reflection in the x-axis 


(Copyright 2012 Omage Lemming AN Rights Ramrved. May not ba opts, crane, or dapticaned, in whale or ia part Dus to electronic dgh», mma third party contest may de mppamai fren the eBook zadur eChagmar(s). Edixriai erview bas 
Ged hat any capyesesd costed! does oct mammiaity fisc the crani larning anpeiame Camgegy Learning aves the rigit to save atkficional costed! at eay time if athemypuems igin cemrictirms require it. 


7.14 Definition and Examples of Groups 175 


h = reflection in line y = x 


If you perform one of these motions and follow it by another, the result will be 
one of the eight listed above; for example, 


If you think of a motion asa function from the square to itself, then the idea of follow- 
ing one motion by another is just composition of functions In the illustration above 
(h followed by r is £, we can wnte r; ° A = t (remember r; ° h means first apply A, then 
apply rı). Verify that the set 


D, = {ro Fis F2 F3, h, v, d, B 
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equipped with the composition operation has this table: 


o wo h Pe m Gd ih t w 
fro lo Ty fo ry d K t g 
w (zi t rs m hk t w d 
ro |r Bh fo n t v d h 
nh |n o nrn v d h t 
d |d vv t h wm ry % NA 
k |b dv t mH my B N 
t t h d unh hh 
v » t h d m gi w % 


Clearly D, is closed under °, and composition of functions is known to be associative. 
The table shows that rọ is the identity element and that every element of D, has an 
inverse, For instance, r3°r, = ro = r; ° r3 Therefore, D; is a group. It is not abelian 
because, for example, h o d # d ° h. D, is called the dihedral group of degree 4 or 
the group of symmetries of the square. 


EXAMPLE 6 


The group of symmetries of the square is just one of many symmetry groups. 

An analogous procedure can be carried out with any regular polygon of n sides. 
The resulting group D, is called the dihedral group of degree 2. The group Ds, for 
example, consists of the six symmetries of an equilateral triangle (counterclockwise 
rotations about the center of 0°, 120°, and 240°; and the three reflections shown 
here), with composition of functions as the operation: 


3 3 3 2 
Yo ti 
a i 
I I I 3 
2 1 
3 1 7 
ta s 
i PE ge 
I 2 1 T 
2 3 2214752 Instructor Instructoß 


3 1 3 3 


Symmetry groups arise frequently in art, architecture, and science. 
Crystallography and crystal physics use groups of symmetries of various 
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three-dimensional shapes. The first accurate model of DNA (which led to the 
Nobel Prize for its creators) could not have been constructed without a recogni- 
tion of the symmetry of the DNA molecule. Symmetry groups have been used by 
physicists to predict the existence of certain elementary particles that were later 
found experimentally. 


Groups and Rings 


A ring R has two associative operations, and it is natural to ask if R is a group under 
either one. For addition the answer is yes: 


Theorem 7.1 


Every ring is an abelian group under addition. 


Proof An examination of the first five axioms for a ring (in Section 3.1) shows that 
they are identical to the five axioms for an abelian group, with the operation * 
being +, the identity element e being 0p, and the inverse of a being -a. E 


EXAMPLE 7 


By Theorem 7.1, each of the following familiar rings is an abelian group under 
addition: 


Z, Z„ Q, R, C; 
Matrix rings, such as M(R) and M(Z2); 
Polynomial rings such as Z[x], R[x], and Z,[x]. 


Hereafter, when we use the word “group” without any qualification in referring 
to these or other rings, it is understood that the operation is addition. 


Multiplication, however, is a different story: 
A nonzero ring R is never a group under multiplication. 


If R has no identity, Axiom 3 fails. If R has an identity, then Og has no inverse and 
Axiom 4 fails. Nevertheless, certain subsets of a ring with identity may be groups 
under multiplication. 


Theorem 7.2 


The nonzero elements of a field F form an abelian group under multiplication. 


Hereafter we shall denote the set of nonzero elements in a field F by F*. 


Proof of Theorem 7.2 > Multiplication in F* satisfies the following ring axioms: 
6 and 11 (closure), 7 (associativity), 10 (identity), 12 (inverses), and 9 
(commutativity}—see pages 44, 48, and 49. So F™ satisfies group axioms 
1-5 and, therefore, is an abelian group under multiplication. W 
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EXAMPLE 8 


Theorem 7.2 shows that each of the following is an abelian group under 
multiplication: 


Q* the nonzero rational numbers; R* the nonzero real numbers; 


€* the nonzero complex numbers. 


EXAMPLE 9 


If p is prime, then Z, is a field by Theorems 2.7 and 2.8. Therefore, Z,* is a 
group under multiplication by Theorem 7.2. 


EXAMPLE 10 


The positive rational numbers Q** form an infinite abelian group under multi- 
plication, because the product of positive numbers is positive, 1 is the identity 
element, and the inverse of a is 1/a. Similarly, the positive reals R** form an 
abelian group under multiplication. 


EXAMPLE 11 


The subset {1, —1, i, —#} of the complex numbers forms an abelian group of 
order 4 under multiplication. You can easily verify closure, and | is the identity 
element. Since i(— i) = 1, i and —i are inverses of each other; —1 is its own 
inverse since (—1)(-1) = 1. Hence, Axiom 4 holds. 


EXAMPLE 12 


Neither the nonzero integers nor the positive integers form a group under mul- 
tiplication. Although 1 is the multiplicative identity for each system, no integers 
except for +1 have a multiplicative inverse, so Axiom 4 fails. For example, the 
equation 2x = 1 has no integer solution, so 2 has no inverse under multiplica- 
tion in the integers. 


EXAMPLE 13 


When n is composite, the nonzero elements of Z, do not form a group under 
multiplication because (among other things) closure fails. In Z,, for instance, 
2-3 =Oand in Zy, 4° 5 = 0. Similarly if n = rs, then in Z,, rs = 0. 


A ring R with identity always has at least one subset that is a group under 
multiplication. Recall that a unit in R is an element a that has a multiplicative inverse, 
that is, an element u such that au = lp = ua. 
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Theorem 7.3 


If R is a ring with identity, then the set U of all units in R is a group under 
multiplication.* 


Proof» The product of units is a unit (Exercise 15 in Section 3.2), so U is closed under 
multiplication (Axiom 1). Multiplication in R is associative, so Axiom 2 holds. 
Since 1, is obviously a unit, U has an identity element (Axiom 3). Axiom 4 
holds in U by the definition of unit. Therefore, Visa group @ 


EXAMPLE 14 


Denote the multiplicative group of units in Z, by U,. According to Theorem 2.10, 
U, consists of all a € Z, such that (a, n) = 1 (when a is considered as an ordinary 
integer). Thus the group of units in Z, is Ug = {1, 3, 5, 7}, and the group of units 
in Z,5is U,; = {1, 2, 4, 7, 8, 11, 13, 14}. Here is the operation table for U;: 


7 


N UW me 
YU U eje 
U N e Ww] Ww 
wo e N MJU 
e WwW UN N 


EXAMPLE 15 


Examples 7 and 10 of Section 3.2, and Exercise 17 of Section 3.2 show that the 
group of units in M(R) is 


GL(2, R) = {( 7) | where a,b, c,d € R and ad — be + o}, 


which is called the general linear group of degree 2 over R. It is an infinite 
nonabelian group (Exercise 7). 


EXAMPLE 16 


Examples 8 and 10 of Section 3.2, and Exercise 17 of Section 3.2 show that the 
group of units in M(@,) is 


GL(2, Z,) = (e ’) 


the general linear group of degree 2 over Z2. It is a nonabelian finite group of 
order 6 (Exercise 7). 


where a, b,c, d € Z, and ad — bc + o}, 


*Theorem7.2is a special case of Theorem 7.3 because the units in a field are the nonzero elements. 
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New Groups from Old 


The Cartesian product, with operations defined coordinatewise, allowed us to con- 
struct new rings from known ones. The same is true for groups. 


Theorem 7.4 


Let G (with operation +) and H (with operation o} be groups. Define an operation = 
on G XH by 


(g, h)=(g', h’) = (geg' hob’). 


Then G x H isa group. If G and H are abelian, then so is G x H. If G and H 
are finite, then so is G X Hand |G x H|=|G||H} 


Proof» Exercise 26. m 


EXAMPLE 17 


Both Z and Z, are groups under addition. In Z xX Z, we have (3, 5) = (7, 4) = 
(3 +7, 5 + 4) = (10, 3). The identity is (0, 0), and the inverse of (7, 4) is ( —7, 2). 


EXAMPLE 18 


Consider R* X D,, where R* is the multiplicative group of nonzero real num- 
bers. The table in Example 5 shows that 

(2, n)", v) E (2 k 9, ri ov) = (18, d). 
The identity element is (1, ro), and the inverse of (8, r4) is (1/8, r). 


E Exercises 


A. 1. Find the inverse of each permutation in S}. 


2. Find the multiplicative inverse of each nonzero element in 


(a) Z; (b) Zs (c) Z; 
3. What is the order of each group: 
(a) Zig (b) Dy (©) S4 (d) S; (e) Us 


4. Determine whether the set G is a group under the operation +. 
(a) G = {2, 4, 6, 8} in Zio; a * b = ab 
(b) G=Zjaxb=a-b 
(c) G = {nEZ]|nis odd};a+b=a +b 
(d) G = {2* | xE Q}; a x b = ab 
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5. Find the inverse of the given group element. [Hint: Example 8 in Section 3.2— 
or Example 16 in Section 7.1.A—-and Exercise 2.] 


z OQ}. 1 2\. Dri 
(> Jez oG Jez oG Jrz 


6. Give an example of an abelian group of order 4 in which every nonidentity 
element a satisfies a + a = e. [Hint: Consider Theorem 7.4.] 


7. (a) Show that the group GL(2, Z,) has order 6 by listing all its elements. 
(b) Show by example that the groups GL(2, R) and GL(2, Z,) are nonabelian. 
8. Use Theorem 2.10 to list the elements of each of these groups: U4, Us, Uio» 
Un, Ur. 
9. Write out the operation table for the group D; described in Example 6. 


10. Show that G = {( 4 4 
-b a 


matrix multiplication. 


a, b E R, not both of is an abelian group under 


11. Consider the additive group Z, and the multiplicative group L = {+1, + i} of 
complex numbers. Write out the operation table for the group Z, X L. 


12. Let T be a nonempty set and A(T) the set of all permutations of T. Show that 
A(T) is a group under the operation of composition of functions. 


13. Give examples of nonabelian groups of orders 12, 16, 30, and 48. 
[Hint: Theorem 7.4 may be helpful.] 


B. 14. Show that every rigid motion of the square (as described in the footnote at the 
beginning of Example 5) has the same result as an element of D4. [Hint: The 
position of the square after any motion is completely determined by the location 
of corner 1 and by the orientation of the square—face up or face down.] 


15. Write out the operation table for the symmetry groups of the following figures: 


(a) (b) (c) i E | 


16. Let 1, i, j, k be the following matrices with complex entries: 


of 3 0 3 HG d G8 


(a) Prove that 
P=p=kK=-1 ij = —ji=k 
jk = -kj =i i = —ik =j. 
(b) Showthatset Q = {1,i, —1, —i, j, k, —j, —k} is a group under matrix 


multiplication by writing out its multiplication table. Q is called the 
quaternion group. 
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17. If Gis a group under the stated operation, prove it; if not, give a counterexample: 
(a) G=Q;atb=at+b+3 
(b) G= {re Q|r + 0};a% b= ab/3 

18. Let K = {rE R]|r # 0, r + 1}. Let G consist of these six functions from K to K: 


fe) == alx)=2—* Moi 
x 
x-1 


i(x) =x Kx) =1-—x k(x) = 


Is Ga group under the operation of function composition? 


19. Do the nonzero real numbers form a group under the operation given by a + b = 
|a|, where | a| is the absolute value of a? 


20. Prove that S, has order n!. [Hint: There are n possible images for 1; after one 
has been chosen, there are n — 1 possible images for 2; etc.] 


21. Suppose G is a group with operation +. Define a new operation # on G by 
a # b = b + a. Prove that G is a group under #. 


22. List the elements of the group D; (the symmetries of a regular pentagon). 
[Hint: The group has order 10.] 

23. Let SL{2, R) be the set of all 2 X 2 matrices is a such that a, b, c, dE R 
and ad — be = 1. Prove that SL(2, R) is a group under matrix multiplication. 
It is called the special linear group. 


24. Prove that the set of nonzero real numbers is a group under the operation * 


defined by 
Heeb Bs ab ifa>0 
ajb ifa<0. 
25. Prove that R* x R is a group under the operation + defined by (a, b) + (c, d) = 
(ac, be + d). 


26. Prove Theorem 7.4. 
27. If ab = ac ina group G, prove that b = e. 


28. Prove that each element of a finite group G appears exactly once in each row 
and exactly once in each column of the operation table. [Hint: Exercise 27.] 


29. Here is part of the operation table for a group G whose elements are a, b, c, d. 
Fill in the rest of the table. [Hint: Exercises 27 and 28.] 


la b c d 


a 
ala 
by | b> a 
e |e 
d|d 
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30. A partial operation table for a group G = {e, a, b, c, d, f} is shown below. 
Complete the table. [Hint: Exercises 27 and 28.] 


e b et dd? f 
e |e a c d f 
ala b e d 
b| b 
ole f a 
d | d 
| 


31. Let T be a set with at least three elements. Show that the permutation group 
A(T) (Exercise 12) is nonabelian. 


32. Let T be an infinite set and let A(T) be the group of permutations of T 
(Exercise 12). Let M = {fE A(T)| f(t) + t for only a finite number of tE T}. 
Prove that M is a group. 


33. If a, bER with a + 0, let T,,: — R be the function given by T,,(x) = ax + b. 
Prove that the set G = {7,,,|a, b ER with a # 0} forms a nonabelian group 
under composition of functions. 


34. Let H = {Tı} |b E R} (notation as in Exercise 33). Prove that H is an abelian 
group under composition of functions. 


C.35. If fE Sp prove that f*=-J for some positive integer k, where f* means 
fefefe --- of (k times) and Tis the identity permutation. 


36. Let G = {0, 1, 2,3, 4, 5, 6, 7} and assume Gis a group under an operation * 
with these properties: 
(i) axb<sa +b foralla, bE G; 
(ii) a xa = 0 foralla EG. 
Write out the operation table for G. [Hint: Exercises 27 and 28 may help.] 


7.1.8 Definition and Examples of Groups 


NOTE: If you have read Section 7.1, omit this section and begin Section 7.2. 


A group is an algebraic system with one operation. Some groups arise from familiar 
systems, such as Z, Z,, the rational numbers, and the real numbers, by ignoring one 
of their operations and concentrating on the other. As we shall see, for example, the 
integers form a group under addition (but not multiplication) and the nonzero ratio- 
nal numbers form a group under multiplication (but not addition). But many groups 
do not arise from a system with two operations. The most important of these latter 
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groups (the ones that were the historical starting point of group theory) developed 
from the study of permutations.* Consequently, we begin with a consideration of 
permutations. 

Informally, a permutation of a set Tis just an ordering of its elements. For example, 
there are six possible permutations of T = {1, 2, 3}: 


123 132 213 231 312 32L 


Each such ordering determines a bijective function from T to T: map 1 to the first 
element of the ordering, 2 to the second, and 3 to the third.f For instance, 2 3 1 de- 
termines the function f: T — T whose rule is f (1) = 2; f (2) = 3;/(3) = 1. Conversely, 
every bijective function from T to T defines an ordering of the elements, namely, 
SM: f(2); f (3). Consequently, we define a permutation of a set T to be a bijective 
function from T to T. This definition preserves the informal idea of ordering and 
has the advantage of being applicable to infinite sets. For now, however, we shall 
concentrate on finite sets and develop a convenient notation for dealing with their 
permutations. 


EXAMPLE 1 
Let T = {1, 2, 3}. The permutation f whose rule is f (1) = 2, f (2) = 3, f@) = 1 


may be represented by the array ( A 7 in which the image under fof an 


element in the first row is listed immediately below it in the second row. Using 
this notation, the six permutations of T are 


JG 3 a) 1 
JG i 2G 2 a): 


Since the composition of two bijective functions is itself bijective, the composi- 
tion of any two of these permutations is one of the six permutations on the list 


above. For instance, if f = é k + and g = G ; i) then fo g is the func- 
tion given by 


nS 
= m 
V N NN 


(f° gM) = fe) = S2) = 2 
(Fe gX2) = f(e(2)) =S) = 3 
(f°8X3) = f(g) =G) =L 


*In the early nineteenth century, permutations played a key role in the attempt to find formulas for 
solving higher-degree polynomial equations similar to the quadratic formula. For more information, 
see Chapter 12. 

tBijective functions are discussed in Appendix B. 


aie es Ses arp wh a eat regen Rp A eT a ae Dus to dacrtnic cights, mma shard party eamest may be appmeat fren the eBook eatV/or eChapeey(a). Editorial review hes 
that any capgresned cose? does Oot nateielty afec the orem turning apeme Cengage Leaning maven the right to rave ekfticn contast at any thie if sthemyunnt rights cestricticms require Ht. 


7.1.A Definition and Examples of Groups 185 


Thus fo g = ( ; a It is usually easier to make computations like this 


by visually acing an element’s progress as we first apply g and then f; for example, 


ra ` 


(Ft )-039 


If we denote the set of permutations of T by S}, then composition of functions 
(e )is an operation on the set S, with this property: 


If fE S, and g E S, then f ° g E S}. 
Since composition of functions is associative,* we see that 


(Jeg)eh=fo(geh)  forallf, g, hess. 
: z ' a t23 : 
Verify that the identity permutation 7 = 123 has this property: 


Iof=f and fol=f  forevery fes, 
Every bijection has an inverse function;* consequently, 
if f€ S}, then there exists g E€ S, such that 
feg=I and gof=I. 


For instance, if f = ~ A -i then g = ( i 3) because 


Et en ese a 
‘1.0 SHS 3 Ud 2 4 
> (ee F OaG a 
232 Y wiy Uga 


You should determine the inverses of the other permutations in S, (Exercise 1). 
Finally, note that f° g may not be equal to g° f: for instance, 


Ae tt Ae 
22 aw tee a l 
H as ii 
Siw Gay G i & 


and 


but 


*See Appendix B. 
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By abstracting the key properties of S, under the operation °, we obtain this 


Definition A group is- a nonempty set : equipped with a binary operation « that 


satisfies the following axiomst: 


1. Closure: Ifae G and beG, thena *be6G. 

2. Associativity: a » (b «c} = (a * b) *c for all a, b, cE G, 

3. There is an element ee G (called the identity element) such that 
a +e =a=exaforeveryacG. 


4. For each aeG, there ts an element de G (called the inverse of a) 
such thata xd = e and d xa =e, 


A group is said to be abelian* if it also satisfies this axiom: 
5. Commutativity: a +b = b xaforalla, beG. 


A group Gis said to be finite (or of finite order) if it has a finite number of elements. 
In this case, the number of elements in Gis called the order of G and is denoted |G}. A 
group with infinitely many elements is said to have infinite order. 


EXAMPLE 2 


The discussion preceding the definition shows that S, is a nonabelian group of 
order 6, with the operation + being composition of functions. 


EXAMPLE 3 


The permutation group S; is just a special case of a more general situation. Let 
n bea fixed positive integer and let T be the set {1, 2, 3, . . . , n}. Let S, be the set 
of all permutations of T (that is, all bijections T —> T). We shall use the same 


23456 
notation for such functions as we did in $3. In Se, for instance, P 6 35 i 


denotes the permutation that takes 1 to 4, 2 to 6, 3 to 2, 4 to 3, 5 to 5, and 6 to 
1. Since the composite of two bijective functions is bijective; S, is closed under 
the operation of composition. For example, in S6 


FAE ia 


if 2 4 4 5 Je ( 2, 3 i 5 aoip 23 4 5 s) 
3 #5 2 4 1 6 6 4 2 3 5 1 6 452 1 3 
ow" As 


` 
rii e a 


Aon a ENA o O O aAa 
Tn ee le men ee - E i 


tBinary operations are defined in Appendix B. 
tin honor of the Norwegian mathematician N. H. Abel (1802-1829). 
§See Appendix B for details. 
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(Remember that in composition of functions, we apply the right-hand function 
first and then the left-hand one. In this case, for instance, 4 + 3 > 2, as shown 
by the arrows.) We claim that S, is a group under this operation. Composition 
of functions is known to be associative, and every bijection has an inverse func- 
tion under composition. It is easy to verify that the identity permutation 


4 : i = >) is the identity element of S,„. S» is called the symmetric 


group on n symbols. The order of S, is n! = n(n — 1)(n — 2)... 2.1 (Exercise 20). 


EXAMPLE 4 


The preceding example is easily generalized. Let T be any nonempty set, 
possibly infinite. Let A(T) be the set of all permutations of T (all bijective 
functions T—> T). The arguments given above for S, carry over to A(T) and 
show that A(T) is a group under the operation of composition of functions 
(Exercise 12). 


EXAMPLE 5 


Think of the plane as a sheet of thin, rigid plastic. Suppose you cut out a 
square, pick it up, and move it around, then replace it so that it fits exactly in 
the cut-out space. Eight ways of doing this are shown below (where the square 
is centered at the origin and its corners numbered for easy reference). We claim 
that any motion of the square that ends with the square fitting exactly in the 
cut-out space has the same result as one of these eight motions (Exercise 14). 


All Rotations Are Taken Counterclockwise Around the Center. 


1 = rotation of 0° 


tSee Appendix B for details. 
tFlip it, rotate it, turn it over, spin it, do whatever you want, as long as you don’t bend, break, or 
distort it. 
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= rotation of 90° 


4 
Ail 
SS 
1 3 
2 


n = rotation of 180° 


fsa 


= rotation of 270° 


d = reflection in the x-axis 
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t = reflection in the y-axis 


4 
3 1 


= reflection in line y = x 


If you perform one of these motions and follow it by another, the result will be 
one of the eight listed above; for example, 
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Tf you think of a motion as a function from the square to itself, then the idea of fol- 
lowing one motion by another is just composition of functions, in the illustration 
above (h followed by r, is £), we can write r, ° k = t (remember r; ° kh means first apply 
h, then apply r,). Verify that the set 


D, = {ro, ri, rz r3, h, v, d, t} 


equipped with the composition operation has this table: 


Clearly D; is closed under °, and composition of functions is known to be associa- 
tive. The table shows that ro is the identity element and that every element of D, has 
an inverse. For instance, r; ° r} = ro = r) ° r3 .Therefore, D, isa group. It is not abelian 
because, for example, ho d # do h. Dy, is called the dihedral group of degree 4 or the 
group of symmetries of the square. 


EXAMPLE 6 


The group of symmetries of the square is just one of many symmetry groups, An 
analogous procedure can be carried out with any regular polygon of n sides. The 
resulting group D, is called the dihedral group of degree n. The group Ds, for ex- 
ample, consists of the six symmetries of an equilateral triangle (counterclockwise 
rotations about the center of 0°, 120°, and 240°; and the three reflections shown 
here and on the next page), with composition of functions as the operation: 


3 3 3 2 
" A 
W oo, 
1 1 1 3 
2 2 2 i 
3 
s 
ae 
I 1 
2 
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Symmetry groups arise frequently in art, architecture, and science. Crystallography 
and crystal physics use groups of symmetries of various three-dimensional shapes. 
The first accurate model of DNA (which led to the Nobel Prize for its creators) could 
not have been constructed without a recognition of the symmetry of the DNA mol- 
ecule. Symmetry groups have been used by physicists to predict the existence of certain 
elementary particles that were later found experimentally. 


Systems with Two Operations 


We now examine some familiar systems with two operations to see what groups arise 
when only one of the operations is considered. 


EXAMPLE 7 


We now show that each of the following is an abelian group under addition, 
that is, with the operation « in the definition of a group being +: 


Z the integers; Z, the integers mod r; 
Q the rational numbers; R the real numbers; C the complex numbers. 


That each system is closed under addition is a fact from basic arithmetic 
(Axiom 1). Likewise, addition in each of these systems is associative: For any 
three numbers a, b, c, 


at+(b+c=(atbte [Additive form of Axiom 2] 
In each system, the identity element is 0 because 
a+0=a=0+a [Additive form of Axiom 3] 
Similarly, the inverse of a is —a because 
a+(-a)=0 and -a+a=0 [Additive form of Axiom 4] 
Finally, each group is abelian because for any two numbers a and b, 
at+b=bta [Additive form of Axiom 5] 


Hereafter, when we use the word “group” without any qualification in refer- 
ring to Z, Z,, Q, R, or C, it is understood that the operation is addition. When 
it comes to multiplication, we have this basic fact: 


None of Z, Z,, Q, R, or C is a group under multiplication. 
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To be sure, each has 1 as its multiplicative identity element, but unfortunately 
0 has no inverse-—the equation 0x = | has no solutions—so Axiom 4 fails. 
Nevertheless, certain subsets of these systems may be groups under multiplication. 


EXAMPLE 8 


Each of the following is an abelian group under multiplication: 
@* the nonzero rational numbers; R* the nonzero real numbers; 


€* the nonzero complex numbers. 


Each system is closed under multiplication because the product of nonzero num- 
bers is nonzero (Axiom 1). Basic arithmetic tells us that multiplication is associa- 
tive and commutative (Axioms 2 and 5). The identity element in each system is 1 
because a+ 1 = a = 1 + a (Axiom 3). The inverse of a is 1/a (Axiom 4). 


EXAMPLE 9 


Let p be a prime, and consider the nonzero elements of Z, under multiplica- 
tion. If a # 0 and b # 0, then ab + 0 by condition (3) of Theorem 2.8, so 
closure holds (Axiom 1). The identity element is 1 (Axiom 3) and inverses exist 
by condition (2) of Theorem 2.8 (Axiom 4). Multiplication is associative and 
commutative by Theorem 2.7 (Axioms 2 and 5). So the nonzero elements of Z, 
form an abelian group under multiplication. 


EXAMPLE 10 


Each of 
Q” the positive rational numbers and R™ the positive real numbers 


is an abelian group under multiplication. Both systems are closed under multi- 
plication since the product of positive numbers is positive. The identity element 
is 1 and the inverse of ais 1/a. 


EXAMPLE 11 


The subset L = {1,—1, 4, ~i) of the complex numbers forms an abelian group 
under multiplication. You can easily verify that closure holds and that 1 is the 
identity element. Since i(i) = — = +—1) = 1, we see that i and -i are inverses 
of each other; —1 is its own inverse since (-1)(-1) = 1. Hence, Axiom 4 holds. 


EXAMPLE 12 


Neither the nonzero integers nor the positive integers form a group under multiplica- 
tion. Although | is the multiplicative identity for each system, no integers except for 
+1 have a multiplicative inverse, so Axiom 4 fails. For example, the equation 2x = 1 
has no integer solution, so 2 has no inverse under multiplication in the integers. 
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EXAMPLE 13 


When » is composite, the nonzero elements of Z, do not form a group under 
multiplication because (among other things) closure fails. In Z6, for instance, 
2-3 = 0 and in Zp, 4 ° 5 = 0. Similarly if n = rs, then in Z,, rs = 0. 


EXAMPLE 14 


Let U, be the set of units in 7,.* By Exercise 17 of Section 2.3, the product of 
two units is a unit, so U, is closed under multiplication (which is known to be 
associative and commutative). The identity 1 is a unit since 1 - 1 = 1. So U, 

is an abelian group under multiplication. By Theorem 2.10, U, consists of all 
acZ,, such that (a, n) = 1 (when a is considered as an ordinary integer). Thus, 
the group of units in 7, is Us = {1, 3, 5, 7}, and the group of units in Z}; is 
Uis = {1, 2, 4, 7, 8, 11, 13, 14}. Here is the multiplication table for U3: 


1 
1 
3 
5 
7 


N U U = 

U N e Wlw 
WU = N aju 
=. wv v NiN 


The next example involves matrices.t A 2 X 2 matrix over the real numbers, is an 
array of the form 


D Pe where a, b, c, d are real numbers. 


Two matrices are equal provided that the entries in corresponding positions are 
equals, that is, 


a b Py xX 3 } 
» Ae Å if and only if a=r,b=s,e=t,d=u. 


(5 =G a ™ G eG 2) 


Matrix multiplication is defined by 
a b\fw _ fawtby ax+bz 
c d\\y z) \ewtdy ex+dz) 


*Recall that an element ain Z, is a unit if the equation ax = 1 has a solution (that is, if a has an inverse 
under multiplication). 
tif you have taken a course in linear algebra, you can skip this paragraph. 


For example, 
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For example, 
i le ek 2*1+3°6 a | 20 11 
o -4f\6 7) \O-1+(-4)6 0(-5)+(-4)7/° \-24 -2y 
Reversing the order of the factors in this product produces 
1 = 2 ay y *2+(-5)0 Tis a 3 ( 2 23 
6 TINO = 6°2+7°0 6-3 + 7(—4) 12 -10/ 


So matrix multiplication is not commutative. A straightforward (but tedious) compu- 
tation shows that matrix multiplication is associative. It’s easy to verify that 


Ge dC dC OG 1) 

0 1/\e d c d) \e go iy 
1 0\. P ; 

Hence, 01 is the identity element. 


EXAMPLE 15 


We shall show that the set of matrices 
a b 
{(2 2) | where ab e deR andad ~ be + 0} 


is a group under multiplication, called the general linear group of degree 2 over R 
and denoted GL(2, R). The discussion before the example shows that GL(2, R) 
has associative multiplication and an identity element (Axioms 2 and 3). You can 
readily verify that when ad — be # 0, 


d —b d —b 


$ b\| ad-—be ad- bc 1 3 ad- be ad-be\fa b 1 n 
= and = 4 
c d “¢ a 0 1 —e a w d 0 1 


ad—be ad—be ad—be ad—be 


So every matrix in GL(2, R) has an inverse (Axiom 4). 
To finish the proof, we need only show that GL(2, R) is closed under multiplication 


(Axiom 1). Suppose that is “and H a are in GL(2, R), so thatad — be #0 
and wz — xy # 0, and hence, (ad — bce)(wz — xy) + 0. To prove that 


é ale x a oe) 
c dj\y z cw + dy cx + dz 
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is in GL(2, R), we must prove that (aw + by\(ex + dz) — (ax + bz)(cw + dy) + 0. 
Verify that 


(aw + by)(cx + dz) — (ax + bz)(cw + dy) = (ad — be)(wz — xy) #0. 


So the product matrix is in GL(2, R). Therefore, GL(2, R) is closed under multi- 
plication and is a group, which is nonabelian (Exercise 7). 


The discussion preceding Example 15 carries over to matrices whose entries are in 
systems other than the real numbers, such as Q, C, and Z, (with p prime). 


EXAMPLE 16 


We shall show that 
a b 
GL(2, 22) = {(2 A| where a, b, c, dE Z, andad — be + o}, 


the general linear group of degree 2 over Z,, is a group under multiplication. 
Matrix multiplication is associative, and the identity matrix is obviously in 
GL(2, Z2). The proof that GL(2, Z2) is closed under multiplication is identical 
to the one for GL(2, R) in Example 15. If A = (: A E GL(2, Z,), thenad — be #0 
in Z,, so ad — be has an inverse by Example 9. Verify that the inverse of A is 


-Ji e = r 
oe d k ine KA ) which is the same inverse matrix given in 


Example 15, with a change of notation: (ad-— bey’ in place of 


1 
ay Hence, 
GL(2, Z,) is a group. It is a finite nonabelian group of order 6 (Exercise 7). 


New Groups from Old 


The Cartesian product G X H of sets Gand H is defined on page 512 of Appendix B. 
Theorem 7.4 on the next page shows that the Cartesian product can be used to pro- 
duce new groups from known ones.* 


*Theorems 7.1-7.3 appear in Section 7.1 and assume that you have read Chapter 3, so they are 
not included in Section 7.1.4. However, many of the preceding examples are special cases of 
these theorems: Example 1 is a special case of Theorem 7.1; Examples 8 and 9 are special cases 
of Theorem 7.2; and Examples 14-16 are special cases of Theorem 7.3. So you haven't missed 
anything crucial for this chapter. You may wish to read Theorems 7.1-7.3 at a later date, after you 
have read Chapter 3. 
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Theorem 7.4 


Let G (with operation +) and H (with operation o) be groups. Define an opera- 
tion® on G X H by 


(g, h)" (g', h') = (g*g',hoh’). 
Then G X H is a group. If G and H are abelian, then so is G X H. f G and H 
are finite, then so is G X H and |6 x H| = |G||H} 


Proof» Exercise 26. m 


EXAMPLE 17 


Both Z and Z, are groups under addition. In Z X Z; we have (3, 5) " (7, 4) = 
(3 + 7,5 + 4) = (10, 3). The identity is (0, 0), and the inverse of (7, 4) is ( —7, 2). 


EXAMPLE 18 


Consider R* X D,, where R* is the multiplicative group of nonzero real num- 
bers. The table in Example 5 shows that 


(2, r1)" (9, v) = (2° 9, ri ° v) = (18, d). 
The identity element is (1, rp), and the inverse of (8, r4) is (1/8, r). 


@ Exercises 


The exercises for this section are the same as those for Section 7.1—see page 180. 


72 Basic Properties of Groups 


Before exploring the deeper concepts of group theory, we must develop some additional 
terminology and establish some elementary facts. We begin with a change in notation. 

Now that you are comfortable with groups, we can switch to the standard multi- 
plicative notation. Instead of a + b, we shall write ab when discussing abstract groups. 
However, particular groups in which the operation is addition (such as Z) will still be 
written additively. 

Although we have spoken of the inverse of an element or the identity element of a 
group, the definition of a group says nothing about inverses or identities being unique. 
Our first theorem settles the question, however. 


Theorem 7.5 


Let G be a group and let a, b, ee G. Then 


(1) G has a unique identity element. 
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(2) Cancelation holds in G: 
lf ab = ac, then b = c; if ba = ca, then b =c. 
(3) Each element of G has a unique inverse. 


Proof» (1) The group G has at least one identity by the definition of a group. If 
e and e’ are each identity elements of G, then 


ee’ =e [Because e' is an identity element.] 
ee' =e’ [Because e is an identity element.] 
Therefore, 
e= =e’, 
so that there is exactly one identity element. 
(2) By the definition of a group, the element a has at least one inverse 


dsuch that da = e = ad. If ab = ac, then d(ab) = d(ac). By associativity 
and the properties of inverses and identities, 


(da)b = (da)c 
eb = ec 
b=e 


The second statement is proved similarly. 
(3) Suppose that d and d’ are both inverses of a € G. Then ad = e = ad’, 
so that d= d’ by (2). Therefore a has exactly one inverse. W 


Hereafter the unique inverse of an element a in a group will be denoted a~!. The 
uniqueness of a~ means that 


whenever ay = e = ya, then y = a™!. 


Corollary 7.6 


lf G isa group and a, beG, then 
(1) (ab) = ba; 
(2) fat E 


Note the order of the elements in statement (1). A common mistake is to write the 


inverse of ab as a~'5—', which may not be true in nonabelian groups. See Exercise 2 
for an example. 


Proof of Corollary 7.6> (1) We have 


(ab)\(b-'a-") = a(bb™ Ja“ = aea™! = aa! = e 


and, similarly, (b-'a~')(ab) = e. Since the inverse of ab is unique by 

Theorem 7.5, 5-'a~! must be this inverse, that is, (ab)! = b'a! 
(2) By definition, a~'!a = e and (a~“')(a™!) = e, so that 

aa = a-Ya-"')". Canceling a~" by Theorem 7.5 shows that 

a=(a'y". B 
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Let G be a group and let a€ G. We define a* = aa, a’ = aaa, and for any positive 
integer n, 


a" =aaa+--a (n factors). 
We also define a? = e and 
eS es E =I 


=a aoa ea (n factors). 


These definitions are obviously motivated by the usual exponent notation in R and 
other familiar rings. But be careful in the nonabelian case when, for instance, (aby" may 
not be equal to a"b”. Some exponent rules, however, do holdin groups: 


Theorem 7.7 


Let G be a group and let a eG. Then for all m, n in Z, 
a™a" = q™tn and (ary =a™, 


> The proof consists of a verification of each statement 1n each possible 

Proof» The proof consists of a verification of each in each possib] 
case (m = 0,n = 0; m = 0, n < 0; etc.) and is left to the reader 
(Exercise 21). m 


NOTE ON ADDITIVE NOTATION: To avoid confusion, the operation in cer- 
tain groups must be written as addition (for example, the additive group of real 
numbers since multiplication there has a completely different meaning). Here is 
a dictionary for translating multiplicative statements into additive ones: 


Multiplicative Additive 
Notation Notation 
Operation: ab a+b 
Identity: 0 
Inverse: a =a 
Exponents: dď' = aa» » » a (n factors) na=a+a+ ... +a(nsummands) 
a*=q...q' (-—n)a= -a-a-..--a 
Theorem 7.7: d"a" = a"*" (ma) + (na) = (m + nja 
(a) =a™ n(ma) = (mn)a 


Order of an Element 


We return now to multiplicative notation for abstract groups. An element a in a group 
is said to have finite order if a* = e for some positive integer k.* In this case, the order 
of the element a is the smallest positive integer n such that a" = e. The order of a is 


*In additive notation, the condition is ka = 0. 
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denoted |a|. An element a is said to have infinite order if a* # e for every positive 
integer k. 


EXAMPLE 1 


In the multiplicative group of nonzero real numbers, 2 has infinite order 
because 2* # 1 for all k = 1. In the group L = {+1, + i} under multiplication 
of complex numbers, the order of i is 4 because 7? = —1, È = —i, and i* = 1. 


Similarly, |-i] = 4. The element fs 3 >) in S} has order 3 because 


€ 2 iy z [s 2, 3 ana l 2 ay A 23 
S2 2; 3 i 2 1 2 1 2 ay 
The identity element in a group has order 1. 


EXAMPLE 2 


In the additive group 7,9, the element 8 has order 3 because 8 + 8 = 4 and 
8+8+8=0. 


In the multiplicative group of nonzero real numbers, the element 2 has infinite 
order and all the powers of 2 (2%, 2°, 2°, etc.) are distinct. On the other hand, in the 
multiplicative group L = {+1, +i}, the element i has order 4 and its powers are not 
distinct; for instance, 


#=1= ad M= HP? = 7, 


Observe that i!° = i? and 10 = 2 (mod 4). These examples are illustrations of 


Theorem 7.8 


Let G be a group and letaeG. 


(1) a has infinite order, then the elements a“, with k EZ, are all distinct. 
(2) Ha! = a with# # j, then a has finite order. 


Proof > Note first that statement (1) is true if and only if statement (2) is true, 
because each statement is the contrapositive of the other, as explained 
on pages 503-504 of Appendix A. So we need only prove one of them. 
We shall prove statement (2): 

Suppose that a! = a’, with i > j. Then multiplying both sides by a’ 
shows that a’ / = a! = a = e. Since i — j > 0, this says that a has finite 
order. W 
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Theorem 7.9 


Let G be a group and a € G an element of finite order n. Then: 


(1) a& = eif and only ifn |k; 
(2) a! = a! if and only if ì = j (mod n); 
(3) If n = td, with d = 1, then af has order d. 


Proof » (1) If n divides k, say k = nt, then at = d" = (a" = £ = e. Conversely, 
suppose that a* = e. By the Division Algorithm, k = ng + r with 
0 = r < n. Consequently, 


e = aœ = dY = aq’ = (ad = ea’ = ea" =a. 


By the definition of order, n is the smallest positive integer with a" = e. 
Since r< n, @ = e can occur only whenr = 0. Thus, k = nq + 0 and n 
divides k. 

(2) First, note that a' = a/ if and only if a’? = e. [Proof if af = a’, 
then a'/ = e by the proof of Theorem 7.8(2). Conversely, if a’! = e, 
then multiplying both sides by a/ shows that a’ = a/.] But by (1), with 
k = i — j, we havea’! = e if and only if n | (i — J), that is, if and only 
if i = J (mod a). Therefore, a' = æ if and only if i= jJ (moda). 

(3) Since |a| = n, we have (a)? = a” = a" = e. We must show that d is the 
smallest positive integer with this property. If k is any positive integer such 
that (a’* = e, then a* = e. Therefore, n | tk by part (1), say tk = nr = (td)r. 
Hence, k = dr. Since k and dare positive and d | k, wehaved=k. E 


Corollary 7.10 


Let G be an abelian group in which every element has finite order. If ce G is 
an element of largest order in G (that is, |a| = |¢| for all a € G), then the order 
of every element of G divides |c|. m 


For example, (1, 0) has order 4 in the additive abelian group Z, X Z, and every other 
element has order 1, 2, or 4 (Exercise 10(b)). Thus (1, 0) is an element of largest possible 
order, and the order of every element of the group divides 4, the order of (1, 0). 


Proof of Corollary 110» Suppose, on the contrary, that a € G and |a| does not 
divide |c|. Then there must be a prime p in the prime factorization of the 
integer |a| that appears to a higher power than it does in the prime fac- 
torization of |c|. By prime factorization we can write |a| as the product 
of a power of p and an integer that is not divisible by p and similarly for 
c. Thus there are integers m, n, r, s such that |a| = p’m and |c| = p’n, with 
(p, m) = 1 = (p, n) andr > s. By part (3) of Theorem 7.9, the element a” 
has order p" and ec?" has order n. Exercise 33 shows that ac?" has order 
p'n. Hence, |a"c?'| = p’n > p'n = |c|, contradicting the fact that ¢ is an 
element of largest order. Therefore, |a| divides |c]. I 
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E Exercises 


NOTE: Unless stated otherwise, G is a group with identity element e. 
A. 1. If c? = cin a group, prove that ¢ = e. 
2. Leta = K : >) and b = G e >) in S}. Verify that (ab)! + att. 
3. If a, b, ¢, dE G, then (abed) =? 
4. If a, be Gand ab = e, prove that ba = e. 
5. Let £G — G be given by fla) = a". Prove that f is a bijection. 
6. Givean example of a group in which theequation x? = e has more than two solutions. 
7. Find the order of the given element. 
(a) 5in Ug 


1234567). 
©) im 751 OLG 


© j a in GL(2, R) 


> oe 
@) ( ; i) in GL(2, R) 
32 92 
8. Give an example of a group that contains nonidentity elements of finite order 
and of infinite order. 
9. (a) Find the order of the groups Uio, Uj2, and Uy. 
(b) List the order of each element of the group Uy. 
10. Find the order of every element in each group: 
a) OXA OAS Gh OZ 


11. Let G be an additive group. Write statement (2) of Theorem 7.8 and statements 
(1}+(3) of Theorem 7.9 in additive notation. 


12. If a, be Gand n is any integer, show that (aba Y = ab'a™. 


13. If Gis a finite group of order n and a E G, prove that |a| = n. [Hint: Consider the 
n + 1 elements e = æ a, a’, a’, . . . ,a". Are they all distinct?) Thus every element 
in a finite group has finite order. The converse, however, is false; see Exercise 25 
in Section 8.3 for an infinite group in which every element has finite order. 


14. True or false: A group of order n contains an element of order n. Justify your 
answer. 


15. (a) If a€ Gand a’? = e, what order can a possibly have? 
(b) If e + be Gand b? = e for some prime p, what is |b|? 
16. (a) If a€ Gand |al = 12, find the orders of each of the elements a, a?, œ, . . . , a". 


(b) Based on the evidence in part (a), make a conjecture about the order of a 
when |a| = n. 
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17. 


18. 


19. 


32. 


33. 


34. 


Groups 


(a) Let æ, b E G. Prove that the equations ax = b and ya = b each have a 
unique solution in G. [Hint: Two things must be done for each equation: 
First find a solution and then show that it is the only solution.] 


(b) Show by example that the solution of ax = b may not be the same as the 
solution of ya = b. [Hint: Consider S3.] 


Let G = {a,, a, ..-., Ap} be a finite abelian group of order n. Let x = ajaz -+ ap 
Prove that x* = e. 


If a, bE G, prove that |bab™| = al. 


1 0 


. (a) Show that a = ( a d has order 3 in G2, Ranab=( H 


hasorder4, \~! —! 
(b) Show that ab has infinite order. 


. Prove Theorem 7.7. 
. Let G = {e, a,b} be a group of order 3. Write out the operation table for G. 


[Hint: Exercise 28 in Section 7.1] 


. Let G be a group with this property: If a, b, ce G and ab = ca, then b =e. 


Prove that G is abelian. 


. If (ab)? = ab? for all a, b, E G, prove that G is abelian. 
. Prove that G is abelian if and only if (aby ! = a'b! for alla, bE G. 


. Prove that every nonabelian group G has order at least 6; hence, every group 


of order 2, 3, 4, or 5 is abelian. [Hint: If a, be Gand ab + ba, show that the 
elements of the subset H = {e, a, b, ab, ba} are all distinct. Show that either 
@ ¢ H or @ = e in the latter case, verify that aba ¢ H.] 


. If every nonidentity element of G has order 2, prove that G is abelian. 


[Hint: |a| = 2 if and only if a + e anda =a". Why?) 


. If aeG, prove that [al = a`]. 

. If a, b, c E G, prove that there is a unique element x € G such that axb = e. 
. If a, b E G, prove that jab] = [ba]. 

. (a) If a, be Gand ab = ba, prove that (ab)4"l = e. 


(b) Show that part (a) may be false if ab + ba. 


If |G] is even, prove that G contains an element of order 2. [Hint: The identity 
element is its own inverse. See the hint for Exercise 27] 


Assume that a, b € Gand ab = ba. If |a| and |b] are relatively prime, prove that 
ab has order |al|b|. [Hint: See Exercise 31.) 


Suppose G has order 4, but contains no element of order 4. 


(a) Prove that no element of G has order 3. [Hint: If |g| = 3, then G consists 
of four distinct elements g, 8°, g = e, d. Now gd must be one of these four 
elements. Show that each possibility leads to a contradiction.] 


(b) Explain why every nonidentity element of G has order 2. 
(c) Denote theelements of G by e, a, b, c and write out the operation table for G. 
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35. If a,b € G, bô = e, and ab = ba, prove that b? = e and ab = ba. 
36. Suppose a, be G with |a| = 5, b + e, and aba! = FF. Find |b]. 
37. If (ab)? = œb’ and (ab)? = a’ for all a, b € G, prove that Gis abelian. 


C. 38. If (ab)! = a’b’ for three consecutive integers i and all a, bE G, prove that G is 
abelian. 


39. (a) Let G be a nonempty finite set equipped with an associative operation 
such that for all a, b, c, dE G: 


if ab = ac, then b = c and if bd = ed, then b = c. 
Prove that G is a group. 
(b) Show that part (a) may be false if G is infinite. 


40. Let G be a nonempty set equipped with an associative operation with these 
properties: 


(i) There is an element e E G such that ea = a for every a E G. 
(ii) For each aE G, there exists dE G such that da = e. 
Prove that Gis a group. 


41. Let Gbe a nonempty set equipped with an associative operation such that, 
for all a, b EG, the equations ax = b and ya = b have solutions. Prove that G 
is a group. 


73 Subgroups 


We continue our discussion of the basic properties of groups, with special attention 
to subgroups. 


Definition A subset H of a group G is a subgroup of G if H is itself a group under the 
operation in G. 


Every group G has two subgroups: G itself and the one-element group {e}, which is 
called the trivial subgroup. All other subgroups are said to be proper subgroups. 


EXAMPLE 1 


The set R* of nonzero real numbers is a group under multiplication. The group 
R** of positive real numbers is a proper subgroup of R*. 


EXAMPLE 2 


The set Z of integers is a group under addition and is a subgroup of the additive 
group Q of rational numbers. 
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EXAMPLE 3 


The subset L = {1, —1, ¿ —i} of the complex numbers is a group under multipli- 
cation.* So it is asubgroup of C*, the multiplicative group of nonzero complex 
numbers. 


EXAMPLE 4 


Recall that the multiplicative group of units in Zg is Ug = {1, 3, 5, 7}. The 
upper-left quarter of its operation table in Example 14 of Section 7.1 or 
Section 7.1.A shows that the subset {1, 3} is a subgroup of U,. 


EXAMPLE 5 


The upper-left quarter of the operation table for D} in Example 5 of Section 7.1 
or 7.1.A shows that H = {ro ry, rz, r3} is a subgroup of D,. 


EXAMPLE 6 


In the additive group 7, X #4, let H = {(0, 0), (3, 0), (0, 2), (3, 2)}. Verify that 
H is a subgroup by writing out its addition table. 


When proving that a subset of a group is a subgroup, it is never necessary to check asso- 
Ciatavity. Since the associative law holds for ail elements of the group, itautomatically holds 
when the elements are in some subset H. In fact, you need only verify two group axioms: 


Theorem 7.11 


A nonempty subset H of a group G is a subgroup of G provided that 


(i) ifa, bE H, then ab EH; and 
(ii) if aeH, then a™'eH. 
Proof » Properties (i) and (ii) are the closure and inverse axioms for a group. 
Associativity holds in H, as noted above. Thus we need only verify that 


eE H. Since H is nonempty, there exists an element c € H. By (ii), £} € H, 
and by (i) ce"! = e is in H. Therefore H is a group. E 


EXAMPLE 7 


Let H consist of all 2 X 2 matrices of the form b = f 3 with bE R. Since 
1-1-—5:-:0=1, H is a nonempty subset of the group GL(2, R), which was 


*See Example 11 of Section 7.1 or Section 7.1.A. 
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defined in Example 15 of Section 7.1 or 7.1.A. The product of two matrices in 


His in H because 
(o s) (o °) = G a+ ‘) 
0 i7\0 1 0 1 


The inverse of G yi is fs pt" which is also in H. Therefore, H is a 


subgroup of GZ(2, R) by Theorem 7.11. 


When H is finite, just one axiom is sufficient to guarantee that H is a subgroup. 


Theorem 7.12 


Let H be a nonempty finite subset of a group G. If H is closed under the opera- 
tion in G, then H is a subgroup of G. 


Proof > By Theorem 7.11, we need only verify that the inverse of each element 
of His also in H. If a€ H, then closure implies that a € H for every 
positive integer k, Since H is finite, these powers cannot all be distinct. 
So a has finite order n by Theorem 7.8 and a" = e. Since n — 1 = —1 
(mod n), we have a" ~ ! = a`! by Theorem 7.9. If n > 1, then n — 1 is 
positive and a! = a” 7! is in H. If n = 1, then a = e and a`! =e = a, so 
thata`tisin H. m 


EXAMPLE 8 


Let H consist of all permutations in S; that fix the element 1. In other words, 
= { fE S; |1) = 1}. H is a finite set since S; is a finite group. If g, kE H, 

then g(l) = 1 and A(1) = 1. Hence, (g ° AX( 1) = g(h(1)) = afl) = 1. Thus 

g° keH and H is closed. Therefore, H is a subgroup of S; by Theorem 7.12. 


The Center of a Group 
If Gisa group, then the center of Gis the subset denoted Z(G) and defined by 


Z(G) = {ae G| ag = ga for every g E G}. 


In other words, an element of G is in Z(G) if and only if it commutes with every 
element of G. If Gis an abelian group, then Z(G) = G because all elements commute 
with each other. When G is nonabelian, however, Z(G) is not all of G 


EXAMPLE 9 


The center of S, consists of the identity element alone because this is the only 
element that commutes with every element of S, (Exercise 25). 
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EXAMPLE 10 


The operation table for D, in Example 5 of Section 7.1 or 7.1.A shows that rı 
commutes with some elements of D, (for instance, r} ° r3 = r3 ° r1). However, 
it does not commute with every element of D, because r; ° d # do r,. Hence, 
rı is not in Z(D,) nor is d. Careful examination of the table shows that 

Z(D4) = {ro, rz} since these are the only elements that commute with every 
element of D4 It is easy to verify that {r9, r2} is a subgcoup of D,. This is an 
example of the following result. 


Theorem 7.13 


The center Z(G) of a group G is a subgroup of G. 


Proof > For every g E G, we have eg = g = ge. Hence, e € Z(G) and Z(G) is non- 
empty. If a, bE Z(G), then for any g E G we have ag = ga and bg = gb, 
so that 

(ab)g = a(bg) = a(gb) = (ag)b = (ga)b = g(ab). 


Therefore, ab e Z(G). Finally, if ac Z(G) and g E G, then ag = ga. 
Multiplying both sides of this equation on the left and right by a`! 
shows that 


a (aga! = a (gaya 


ga! =a'lg 
Therefore, a`’ € Z(G) and Z(G) is a subgroup by Theorem 7.11. E 


Cyclic Groups 
An important type of subgroup can be constructed as follows If G is a group and 
aE G, let (2) denote the set of all powers of a: 


(d) = {p a, a? a4, dal, a,...} = {a |neZ}. 


Theorem 7.14 


If G isa group and a €G, then (a) = {a" |n € Z} is a subgroup of G. 


Proof » The product of any two elements of (a) is also in (a) because a'a’ = a'¥/, 
The inverse of a* is a~*, which is also in (a). By Theorem 7.11, (a}is a 
subgroupof G. E 


The group (@) is called the cyclic subgroup generated by a. If the subgroup (a) is the 
entire group G, we say that Gis a cyclic group. Note that every cyclic group is abelian 
since da! = a’! = qld’. 
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EXAMPLE 11 


The multiplicative group of units in the ring Z,, is U,; = {1, 2, 4, 7, 8, 11, 13, 14} 
by Theorem 2.10. In order to determine the cyclic subgroup generated by 7, we 
compute 


J=7 P=4 Pols PH=1=7. 
Therefore, the element 7 has order 4 in Ujs. We claim that the cyclic subgroup 
(7) consists of {7°, 7', 77, 733 = {1, 7, 4, 13}. [Proof: By definition, every ele- 
ment of (7) is of the form 7' for some integer i. Since every integer is congruent 
modulo 4 to one of 0, 1, 2, 3, the element 7' must be one of 7°, 7!, 7? or 7° by 
Theorem 7.9(2).] Hence, (7) = {1, 7, 4, 13}. Thus, the cyclic subgroup (7) has 
order 4—the order of the element 7 that generates the group. 


EXAMPLE 12 


Different elements of a group may generate the same cyclic subgroup. For 
instance, verify that 13 has order 4 in U,s. Then the same argument used in 
Example 11 shows that the cyclic subgroup (13) = {13° 13', 13%, 1333 = 

{1, 13, 4, 7} = (7). 


The argument used in Examples 11 and 12 works in general and provides the con- 
nection between the two uses of the word “order”. It states, in effect, that the order of 
an element a is the same as the order of the cyclic subgroup generated by a. 


Theorem 7.15 


Let G be a group and let aeG. 


(1) If a has infinite order, then @) is an infinite subgroup consisting of 
the distinct elements a*, with ke Z. 

(2) If a has finite order n, then (a} is a subgroup of order n and {a} = 
fe= a, a',a’,a*,...,a°~, 


Proof > (1) This is an immediate consequence of part (1) of Theorem 7.8. 

(2) Let a’ be any element of (a). Then i is congruent modulo n to one 
of 0, 1, 2,...,n ~ 1. Consequently, by part (2) of Theorem 7.9, a' must be 
equal to one of a’, a', a’, ..., ad~". Furthermore, no two of these powers 
of a are equal since no two of the integers 0, 1, 2, ..., n — 1 are congruent 
modulo n. Therefore, (2) = {a°, a',a’,..., a*-"} isa group of ordern. W 


NOTE ON ADDITIVE NOTATION: When the group operation is addi- 
tion, then, as shown in the dictionary on page 198, we write ka in place 


of a*. So the cyclic subgroup (a) = {na | neZ}. Theorem 7.15 in additive 
notation is shown on the next page. 
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Theorem 7.15 (Additive Version) 


Let G be an additive group and letae G. 


(1) If a has infinite order, then @) is an infinite subgroup consisting of 
the distinct elements ka, with kez. 


(2) If a has finite order n, then (a) is a subgroup of order n and 


(a) = {0, 1a, 2a, 3a, 4a, aang (n iiad 1)a}. 


EXAMPLE 13 


Since Z = {n1 | n EZ}, we see that the additive group Z is an infinite cyclic 
group with generator 1, that is Z = (1). The set E of even integers is a cyclic 
subgroup of the additive group Z because E = {n2 |n EZ}. 


EXAMPLE 14 


Each of the additive groups 7, is a cyclic group of order n generated by 1 because 
Z, consists of the “powers” of 1, namely, 1,2 =1 + 1,3 =1 +1 + 1, etc. For 
instance, 7, = {1, 2, 3, 0}, that is, {1,1+1,1+1+1,1+1+1+41}. 


The subgroup {1, —1, i, —i} of the multiplicative group of nonzero elements of C 
is the cyclic subgroup (f because i? = —1, i? = —i, and i‘ = 1. Similarly, the multipli- 
cative group of nonzero elements of Z, is the cyclic group (3), as you can easily verify. 
These examples are special cases of the following theorem. 


Theorem 7.16 


Let F be any one of Q, R, C, or Z, (with p prime), and let F° be the multiplica- 
tive group of nonzero elements of F.* If G is a finite subgroup of F°, then G is 
cyclic.? 


Proof > Let ce G be an element of largest order (there must be one since G is 
finite), say |c| = m. If a€ G, then |a| divides m by Corollary 7.10, so that 
a” = 1 by part (1) of Theorem 7.9. Thus every element of Gis a solu- 
tion of the equation x” — 1 = 0. Since a polynomial equation of degree 
m has at most m solutions in F (by Corollary 4.174), we must have |G] = m. 
But (c} is a subgroup of G of order m by Theorem 7.15. Therefore, (c) 
must be all of G, that is, Gis cyclic. E 


tSee Examples 8 and 9 of Section 7.1 or 7.1. A. 
+For those who have read Chapter 3: The theorem and its proof are valid when F is any field. 
Sif you haven't read Section 4.4, you'll have to take this on faith for now. 
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Now that we know what cyclic groups look like, the next step is to examine the 
possible subgroups of a cyclic group. 


Theorem 7.17 


Every subgroup of a cyclic group is itself cyclic. 


Proof > Suppose G = (a)and H is a subgroup of G. If H = (8 , then H is the 
cyclic subgroup generated by e (all of whose powers are just e). If H # 
(e), then H contains a nonidentity element of G, say a’ with i # 0. Since 
H is a subgroup, the inverse element a™ is also in H. One of i or —i is 
positive, and so H contains positive powers of a. Let k be the smallest 
positive integer such that a* € H. We claim that H is the cyclic subgroup 
generated by a*. To prove this, we must show that every element of H 
is a power of a*. If he H, then h € G, so that h = a” for some m. By the 
Division Algorithm, m = kq + r with0 = r < k. Consequently, r = m — kq 
and 


a’ = a" = a"a™ = g*(a*)-4. 


Both a" and a“ are in H. Therefore, a’ €H by closure. Since a is the 
smallest positive power of ain H and since r < k, we must have r = 0. 
Therefore, m = kq and h = a™ = a*l = (a*} E(a*). Hence, H = (4). m 


For additional information on the structure of cyclic groups and their subgroups, 
see Exercises 4446. 


Generators of a Group 


Suppose G is a group and a E G. Think of the cyclic subgroup (a) as being constructed 
from the one-element set S = {a} in this way: Form all possible products of a and a“ 
in every possible order. Of course, each such product reduces to a single element of 
the form a”. We want to generalize this procedure by beginning with a set S that may 
contain more than one element. 


Theorem 7.18 


Let S be a nonempty subset of a group G. Let (S) be the set of all possible 
products, in every order, of elements of S and their inverses.* Then 


(1) (S) is a subgroup of G that contains set S. 
(2) tf H is a subgroup of G that contains the set S, then H contains the 
entire subgroup (S). 


*We allow the possibility of a product with one element so that elements of S will be in {S}. 
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This theorem shows that (S) is the smallest subgroup of G that contains the set S. In 
the special case when S = {a}, the group (S)is just the cyclic subgroup (a), which is the 
smallest subgroup of G that contains a. The group (S) is called the subgroup generated 
by S. If (S) is the entire group G, we say that S generates G and refer to the elements of 
S as the generators of the group. 


Proof of Theorem 7.18 > (1) (S) is nonempty because the set S is nonempty and 
every element of S (considered as a one-element product) is an element of 
(S) . If a, b E(S), then a is of the form aja, - - - ap, where k = 1 and each a; 
is either an element of S or the inverse of an element of S. Similarly, 

b = bby - - - b, with ¢ = 1 and each b, either an element of S or the in- 
verse of an element of S. Therefore, the product ab = ajay - - - agb,by- - - b; 
consists of elements of S or inverses of elements of S. Hence, ab € (S), 
and (S) is closed. The inverse of the element a = aja, - - « a; of (S) is 

a`! = ay! - - ay'a! by Corollary 7.6. Since each a, is either an element 
of Sor the inverse of an element of S, the same is true of ar !. Therefore, 
a™ e (S). Hence, (S) is a subgroup of G by Theorem 7.11. 


(2) Any subgroup that contains the set S must include the inverse 
of every element of S. By closure, this subgroup must also contain all 
possible products, in every order, of elements of S and their inverses. 
Therefore, every subgroup that contains S must also contain the entire 


group (S). E 
EXAMPLE 15 


The group Uj; = {1, 2, 4, 7, 8, 11, 13, 14} is generated by the set S = {7, 11} 
since 
Paz P= P =13 mat 
va hese PALE Peist: 


Different sets of elements may generate the same group. For instance, you can readily 
verify that U,; is also generated by the set {2, 13} (Exercise 9). 


EXAMPLE 16 
Using the operation table in Example 5 of Section 7.1 or 7.1.A, we see that in 
the group D,, 
G =n (ry =r =n CY = ro 


h =h nokh=t (Peh=v (rYoh=d. 


Therefore, D, is generated by {r,, A}. Note that the representation of group 
elements in terms of the generators is not unique; for instance, 


(Poh=d and hoho} =d. 
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E Exercises 


A. 1. List all the cyclic subgroups of 
(a) Uis (b) Uy 
2. (a) List all the cyclic subgroups of D,. 
(b) List at least one subgroup of D, that is not cyclic. 
3. List the elements of the subgroup (a), of S}, where 


faepe: 
SRR ESLA 


In Exercises 4-8, list (if possible) or describe the elements of the given cyclic subgroup. 

4. (2) in the additive group Z;>. 
5. (2) in the additive group Z. 
6. (2) in the multiplicative group of nonzero elements of Zi. 
7. (2)in the multiplicative group O* of nonzero rational numbers. 
8. (3) in the multiplicative group of nonzero elements of Z,). 
9. Show that U,,is generated by the set {2, 13}. 

10. Show that (1, 0) and (0, 2) generate the additive group Z x Z4. 

11. Show that the additive group Z, X Z; is cyclic. 

12. Show that the additive group Z, X Z, is not cyclic but is generated by two elements. 


13. Let H be a subgroup of a group G. If eg is the identity element of G and ey is 
the identity element of H, prove that eg = ey. 


14. Let H and K be subgroups of a group G. 
(a) Show by example that H U K need not be a subgroup of G. 
(b) Prove that H U Kis a subgroup of Gif and only if H&K or KS H. 


15. (a) Let H and K be subgroups of a group G. Prove that H N K is a subgroup 
of G. 


(b) Let {H;} be any collection of subgroups of G. Prove that N H; is a 
subgroup of G. 


16. Let G, be a subgroup of a group G and H; a subgroup of a group H. Prove 
that G, X H; is a subgroup of G xX H. 


17. Show that the only generators of the additive cyclic group Z are 1 and —1. 
18. Show that (3, 1), (—2, — 1), and (4, 3) generate the additive group Z x Z. 


19. Let G be an abelian group and let T be the set of elements of G with finite 
order. Prove that Tis a subgroup of G; it is called the torsion subgroup. (This 
result may not hold if Gis nonabelian; see Exercise 20 of Section 7.2.) 


20. Let G bean abelian group, x a fixed positive integer, and H = 
{a€ G| |a| divides k}. Prove that H is a subgroup of G. 
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21. 


27. 


29. 


31. 


32. 


33. 


34. 
35. 


36. 


37. 


B. 38. 


Groups 


(a) If G is a group and ab E Z(G), is it true that a and b are in Z(G)? [Hint: D4] 
(b) If Gis a group and ab € Z(G), prove that ab = ba. 


. If ais the only element of order 2 in a group G, prove that aE Z(G). 
. Let G be a group and let a E G. Prove that (a) = (a7). 


. Show that Q**, the multiplicative group of positive rational numbers, is not 


a cyclic group. [Hint: if 1 + r € Q**, then there must bea rational between r 
and r?°] 


. Show that the center of S; is the identity subgroup. 
. (a) Let Hand K be subgroups of an abelian group Gand let HK = {ab|aeH, 


bE K}. Prove that HK is a subgroup of G. 
(b) Show that part (a) may be false if G is not abelian. 


Let H be a subgroup of a group G and, for x € G, let x~!Hx denote the set 
{x7!ax |a € H}. Prove that x-'!Hx is a subgroup of G. 


. Let G be an abelian group and n a fixed positive integer. 


(a) Prove that H = {a€ G| a" = e} is a subgroup of G. 
(b) Show by example that part (a) may be false if G is nonabelian. [Hinz: S3.] 


Prove that a nonempty subset H of a group Gis a subgroup of Gif and only if 
whenever a, be H, thenab~'e H. 


. Let A(T) be the group of permutations of the set T and let 7, be a nonempty 


subset of T. Prove that H = {fe A(T) | f(t) = t for every t E€ T,} is a subgroup 
of A(T). 

Let Tand T, be as in Exercise 30. Prove that K = {fe A(T) |f(1) = Tj} isa 
subgroup of A(T) that contains the subgroup H of Exercise 30. Verify that if 
T, has more than one element, then K # H. 

Let H be a subgroup of a group G and assume that x “'Hx © H for every xe G 
(notation as in Exercise 27). Prove that x -'!Hx = H for each xE G. 

Let G be a group and a E G. The centralizer of a is the set Qa) = {gEG| 

ga = ag}. Prove that C(a) is a subgroup of G. 

If Gis a group, prove that Z(G) = 7, C(a) (notation as in Exercise 33). 


Prove that an element a is in the center of a group Gif and only if C{a) = G 
(notation as in Exercise 33). 

True or false: If every proper subgroup of a group G is cyclic, then G is cyclic. 
Justify your answer. 


Suppose that H is a subgroup of a group Gand that a € G has order n. If a* € H 
and (k, n) = 1, prove that ae H. 
(a) Let p be prime and let b be a nonzero element of Z,. Show that 5? ~ = iL. 
[Hint: Theorem 7.16.] 
(b) Prove Fermat’s Little Theorem: If p is a prime and a is any integer, then 
@ = a (mod p). [Hint: Let b be the congruence class of a in Z, and use 
part (a).] 
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39. 


42. 


43. 
. Let G = {a} be a cyclic group of order n. 


47. 
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ab 
. Prove that H = qe ) 


. Let G bean abelian group and z a fixed positive integer. Prove that H = {a"|aeG} 


Prove that Ii æ 
oO rove al 0 1 


. Prove that Z,, X Z, is cyclic if and only if (m, n) = 1. 
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If His a subgroup of a group G, then the normalizer of H is the set M(H) = 
{x€ G|x Hx = H} (notation as in Exercise 27). Prove that N(H) is a 
subgroup of G that contains H. 


a=lor—l,be z} is asubgroup of GL(2, Q). 


is a subgroup of G. 

Let k be a positive divisor of the positive integer n. Prove that H, = 
{aE U, |æ = | (mod k)} is a subgroup of Up. 

List all the subgroups of Z)2. Do the same for Zy- 


(a) Prove that the cyclic subgroup generated by #” is the same as the cyclic 
subgroup generated by af, where d = (m, n). [Hint: It suffices to show that 
afis a power of a™ and vice versa. (Why?) Note that by Theorem 1.2, there 


are integers u and v such that d = mu + nv] 
(b) Prove that a” is a generator of Gif and only if (m, n) = 1. 


. Let G = {a} be a cyclic group of order n. If His a subgroup of G, show that |] 


is a divisor of n. [Hint: Exercise 44 and Theorem 7.17.] 


. Let G = @) bea cyclic group of order n. If k is a positive divisor of n, prove 


that G has a unique subgroup of order k. [Hint: Consider the subgroup 
generated by a”*] 

Let G be an abelian group of order mn where (m, n) = 1. Assume that G 
contains an element a of order m and an element b of order n. Prove that G is 
cyclic with generator ab. 


. Show that the multiplicative group R* of nonzero real numbers is not cyclic. 


. If Gis an infinite additive cyclic group with generator a. Prove that the 


equation x + x = a has no solution in G. 


. Show that the additive group Q is not cyclic. [Hint: Exercise 49.] 
. Let Gand H be groups. If G X H is acyclic group, prove that Gand H are 


both cyclic. (Exercise 12 shows that the converse is false.) 


ne z} is a cyclic subgroup of GL(2, R). 


. If G # @}is a group that has no proper subgroups, prove that Gis a cyclic 


group of prime order. 


. Is the additive group G = {a + bV2 | a, bE Z} cyclic? 

. Show that the group U» of units in Zy is not cyclic. 

. Show that the group Uj, of units in Z, is cyclic. 

. If Sis anonempty subset of a group G, show that (S) is the intersection of the 


family of all subgroups H such that S & H. 
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7.4 | Isomorphisms and Homomorphisms* 


If you were unfamiliar with roman numerals and came across a discussion of integer 
arithmetic written solely with roman numerals, it might take you some time to realize 
that this arithmetic was essentially the same as the familiar arithmetic in Z except for 
the labels on the elements. Here is a less obvious example of the same situation. 


EXAMPLE 1 


Recall the multiplicative subgroup L = {1, i, -4,—-1} of the complex numbers 
and the multiplicative group Us = {1, 2, 3, 4} of units in Zs, whose operation 
tables are shown below.t 


U; L 
A 2 Ba 1 i -i -1 
LS 2 3 3 1) 1 i -i -l 
ag A 1 2 i ESE T i 
B3 L y 2 -i | -i 1 -1 i 
g T 32 & | -1|-1 -i i 1 


At first glance, these groups don’t seem the same. But we claim that they are 
“essentially the same”, except for the lablels on the elements. To see this clearly, 
relabel the elements of U; according to this scheme: 


Relabel 1 as 1; Relabel 2 as #; Relabel 3 as —é Relabel 4 as —1. 
Now look what happens to the table for U;s—it becomes the table for L! 


The rewritten table shows that the operations in U; and L work in exactly the 
same way—the only difference is the way the elements are labeled. As far as 


*The first few pages of this section explain the concept of isomorphism for groups, which is 
essentially the same as the explanation for rings in Section 3.3. If you have read that section, feel 
free to begin this one at the Definition on page 216. 

tTo make the elements of the two groups easily distinguishable, the elements of Ł are in boldface. 
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group structure goes L; is just the group U, with new labels on the elements. In 
more technical terms, U; and L are said to be isomorphic 


In general, isomorphic groups are groups that have the same structure, in the sense 
that the operation table for one is the operation table of the other with the elements 
suitably relabeled. Although this intuitive idea is adequate for small finite groups, we 
need to develop a rigorous mathematical definition of isomorphism that agrees with 
this intuitive idea and is readily applicable to large groups as well. 

There are two aspects to the intuitive idea that groups G and H are isomorphic: 
telabeling the elements of G, and comparing the new operation table with that of H. 
Relabeling means that every element of G is paired with a unique element of H (its new 
label). In other words, there is a function f:G—H that assigns to each r € G its new label 
S(r) €H. In the preceding example, we used the relabeling function f: U;—>K given by 


Wet fB)=i Jer HO =4 
The function fG—>H must have these properties: 
(1) Distinct elements of G get distinct labels in H: 
If r + r'in G, then f(r) + f(r’) in H. 
(2) Every element of H is the label of some element of G:* 
For each A € H, there is an re G such that f(r) = h. 


Properties (1) and (2) simply say that the function f must be both injective and surjec- 
tive, that is, fis a bijection.* 

In order to be an isomorphism, however, the table of Gmust become the table of H 
when f is applied. If this is the case, then for two elements a and b of G, the situation 
must look like this: 


G H 
* Se) 
a c f(@) fe) 


As indicated in the two tables, 
a»b=cinG and f(a) + fb) = fc)in H 
Since a x b = ¢ in G, we must have f(a « b) = f(e) in H. Combining this with the fact 
that f(c) = f(a) « f(b) in H we see that 
fla + b)=f(a) » fb). 
This is the condition that f must satisfy in order for f to change the operation tables of 
G into those of H. We can now state a formal definition of isomorphism. 


*Otherwise we could not get the complete table of H from that of G. 
tinjective, surjective, and bijective functions are discussed in Appendix B. 
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Definition Let G and H be groups with the group operation denoted by +. & is 
isomorphic to a group H (in symbols, G = H) if there is a function E >H 
such that 


(i) fis injective; 
(ii) fis surjective; 
(iit) f(a =b) = faj» f(b) for alla, be G. 


In this case, the function f is called an isomorphism. 


It can be shown that G = H if and only if H = G (Exercise 53). 


NOTE: In the preceding discussion, we have temporarily reverted to the x 
notation for group operations to remind you that in a specific group, the 
operation might be addition, multiplication, or something else. In such 
cases, condition (iii) of the definition may take a different form; for instance, 


Condition (iii) fia «b) = f(a) « f(b) 


Gand H additive: f(a + b) =f(a) +f) 
G and H multiplicative: flab) = f(af(b) 

G additive, H multiplicative: f(a + b) = f(a)f(b) 

G multiplicative, H additive: (ab) = f(a) + f(b) 


EXAMPLE 2 


The multiplicative group U, = {1, 3, 5, 7} of units in Z; is isomorphic to the 
additive group Z2 X Zz. To prove this, let f: Us —> Z2 X Z, be defined by 
fl) =(©,0) #3) = (1,0) KS) =(0,1) fM = (1,1). 


Clearly fis a bijection. Showing that f(ab) = f(a) + f(b) for a, b € U; is equiva- 
lent to showing that the operation table for Z, X Z, can be obtained from that 
of U; simply by replacing each ae U; by f(a)€ ZX Z, -Use the tables below to 
verify that this is indeed the case. Therefore, f is an isomorphism: 

U; Z, X Z, 
+ |oo (1,0) ©1) a9 


(0,0) | (0,0) (1,0) (0,1) (1,1) 
(1,0) | (1,0) (0,0) (1,1) (0,1) 
(0,1) | (0,1) (1,1) (0,0) (1,0) 
(1,1) |} (1,1) (0,1) (1,0) (0,0) 


æ V v Nfs 


1 
1 
3 
5 
al 


Nn WD = 
nr N Uju 
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EXAMPLE 3 


Let E be the additive group of even integers. We claim that f: Z — E given by 
f(a) = 2a is an isomorphism. Since Z and £ are infinite, comparing tables is 
not an option. However, the formal definition of isomorphism will do the job. 
We begin by showing that fis injective.* Suppose a, b € Z and f(b) = f(b) in E. 


Then 
SD) = fib) 
2a = 2b [Definition of f ] 
a=b [Divide both sides by 2.] 


Hence, f is injective. Now suppose n E E. Since n is an even integer, n = 2k for 
some integer k. Therefore, f(k) = 2k = n, and f is surjective. Finally, for all a, 
bez, 


fla + b) = 2(a +b) = 2a + 2b = fla) + f(b). 


Hence, fis an isomorphism of additive groups. 


EXAMPLE 4 


The additive group R of real numbers is isomorphic to the multiplicative group 
R** of positive real numbers. To prove this, let i R-+R** be given by f(r) = 10. 
To show that fis injective, suppose that 


f(r) = f(s). 
Then 
10 = 10° [Definition of f ] 
log 10” = log 10° [Take logarithms of both sides.] 
r=s [Basic property of logarithms] 


So fis injective. To prove that f is surjective, let kE R. Then r = log k is a real 
number, and by the definition of logarithm, 


fir) = 10 = 10"8* = k, 
Thus, f is also surjective. Finally, 
fir +s) = 10% = 1010 = f(r) f(s). 


Therefore, fis an isomorphism and R = R**, 


“Injective, surjective, and bijective functions are discussed in Appendix B. 
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EXAMPLE 5 


Two finite groups with different numbers of elements (such as Z5 and 29) can- 
not be isomorphic, because no function from one to the other can be a bijection. 


Example 1 presented two groups with the same number of elements that were 
isomorphic. However, this is not always the case. 


EXAMPLE 6 


S, and the additive group Z, each have order 6, but are not isomorphic. There is 
no way to relabel the addition table of Z, to obtain the table of S, because the 
operation in S, is not commutative, but addition in Z, is. A similar argument in 
the general case (see Exercise 16) shows that for groups G and H, 


If Gis abelian and H is nonabelian, then Gand H are not isomorphic. 
EXAMPLE 7 


The additive groups Z, and Z, X Z, each have order 4 but are not isomorphic 
because every nonzero element of Z3 X £2 has order 2, but Z4 has two elements 
of order 4 (namely, 1 and 3). So relabeling the addition table of one cannot 
produce the table of the other. More generally by Exercise 29, 


If f is an isomorphism, then a and f(a) have the same order. 


If Gisa group, then an isomorphism G— Gis called an automorphism of the group G. 


EXAMPLE 8 


If Gis a group, then the identity map ug:G — G given by tg(7) = r is an auto- 
morphism of G. It is clear that ig is bijective, and for any a, bE G, 


g(a * b) = a * b = tg(a) * tg(b). 
EXAMPLE 9 


Let c be a fixed element of a group G. Define f: G — G by f(g) = ege, 
Then 


f(b) f(b) = (aoibhe) = calee™)be = abe = f(ab). 
If gE G, then cgc! € G and 
Slege) = eT (ege™e = fe“ le)g(c'c) = ege = g. 
So fis surjective. To show that f is injective, suppose f(a) = f(b). Then ¢!ae = 
c~'be. Canceling ¢ on the right side and c™' on the left side by Theorem 7.5, we 
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have a = b. Hence, fis injective. Therefore, fis an isomorphism, called the inner 
automorphism of G induced by c. For more about automorphisms, see Exercises 36, 
37, 58, and 59. 


The next theorem completely characterizes all cyclic groups. 


Theorem 7.19 


Let G be acyclic group. 


(1) If G is infinite, then G is isomorphic to the additive group Z. 
(2) If G is finite of order n, then G is isomorphic to the additive group Z,,. 


Proof » (1) Suppose that G = (a) is an infinite cyclic group. By Theorem 7.15 G 
consists of the elements a* with k €Z, all of which are distinct (meaning 
that d = a’ if and only if i = j). The function f:G — Z defined by f(a) = k 
is easily seen to be a bijection (Exercise 17). Since 
S (aa!) = f+ =i+ 7 =f@) + a), 
fis an isomorphism. Therefore, G = Z. 


(2) Now suppose that G = (b) and b has order n. By Theorem 7.15, 
G = {b°, b', b,..., b” — !} , and by Corollary 2.5, Z, = {[0], [1], (2],.-.. 
[n — 1]}. Define g:G > Z, by g(b') = [i]. Clearly g is a bijection. Finally, 
gH) = g+ = [i +s] = [+L = g) + gH). 


Hence, g is an isomorphism and G = Z,. m 


EXAMPLE 10 
In multiplicative group Q* of nonzero rational numbers, the cyclic subgroup 
w e ahh 
i = , 1, 2,4, ET 
generated by 2 is (2) fasa TEET 2,4, 8, 16, anit he 


group (2) is isomorphic to the additive group Z by Theorem 7.19. 


EXAMPLE 11 


The upper left-hand quadrant of the operation table for D, in Example 5 
of Section 7.1 or 7.1.A and Theorem 7.12 show that G = {r9, ři, r2, 73} is 
a subgroup of D,. Verify that both Gand U; = {1, 2, 3, 4} are cyclic. By 
Theorem 7.19 each is isomorphic to the additive group Z4. Hence, they are 
isomorphic to each other: G = U, (Exercise 21). 


“Exercise 7 of Section 7.3. 
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Homomorphisms 


Many functions that are not injective or surjective satisfy condition (iii) of the defini- 
tion of isomorphism. Such functions are given a special name and play an important 
role in later sections of this chapter. 


Definition Let G and H be groups (with operation «). A function f:6 — H is said to be 
a homomorphism if 


fla xb) = fla) x f(b) for alla, bes. 


Every isomorphism is a homomorphism, but a homomorphism need not be an 
isomorphism. 


EXAMPLE 12 


The function /:R* — R* given by f(x) = x? is a homomorphism of multiplica- 
tive groups because 


Sf (ab) = (aby = ab? = f (b) f(b). 


However, f is not injective because f (1) =f(—1) and is not surjective because 
f(x) = x = 0 for all x, so no negative number is an image under f. 


EXAMPLE 13 


The function £Z — Z; given by f(a) = [a] is a homomorphism of additive 
groups because 


f(a + b) = [a + b] = [a] + [5] =f) =f. 
The homomorphism fis surjective, but not injective (Why?). 


EXAMPLE 14 


If Gand Hare groups, the function fG X H > G given by f((x, y)) = xisa 
surjective homomorphism (Exercise 9). If H is not the identity group, g is not 
injective. For instance, if ey + a € H, then (eg, a) # lee, ey) in G X H, but 

I ((eq. 2) = eg and f((eg, ex) = ee- 


Recall that the image of a function fG — H is a subset of H, namely Im f = 
{he H|h = f(a) for some a € G}. The function f can be considered as a surjective map 
from G to Im f. 
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Theorem 7.20 


Let G and H be groups with identity elements eg and ey, respectively. If 
f:G = H is a homomorphism, then 


(1} f(@c) = ey. 

(2) fla") = flay" for every acG. 
(3) Im f is a subgroup of H. 

(4) If fis injective, then G = Im f. 


Proof > (1) Since fis a homomorphism, eg is the identity in G, and ey is the 
identity in H, we have 


Sleadf(ea) =f(ecea) [f is a homomorphism] 

flea) flea) = fa) [eg is the identity in G.] 

Sled f (ea) = enf (ea)  [f(ea) EH and ey is the identity in H.] 
Canceling f (eg) on the right (by Theorem 7.5) produces f (eg) = ey. 
(2) By (1) we have 


FEDS) = f(a) = f (e6) = ex = f(@) FC). 
Canceling f (a) on each end shows that f(a~') = f(a)". 

(3) The identity ege Im f by (1), and so Im f is nonempty. Since 
S@f©@) = f (ab), Im f is closed. The inverse of each f(a) € Im fis also in 
Im f because f(a)! = f(a~') by (2). Therefore, Im fis a subgroup of H by 
Theorem 7.11. 


(4) As noted before the theorem, f can be considered as a surjective 


function from G to Im f. If fis also an injective homomorphism, then f is an 
isomorphism. @ 


Group theory began with the study of permutations and groups of permutations. 
The abstract definition of a group came later and may appear to be far more general 


than the concept of a group of permutations. The next theorem shows that this is 
not the case, however. 


Theorem 7.21 Gayley’s Theorem 


Every group G is isomorphic to a group of permutations. 


Proof > Consider the group A(G) of all permutations of the set G. Recall that 
A(G) consists of all bijective functions from G to G with composition as 
the group operation. These functions need not be homomorphisms. 
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To prove the theorem, we find a subgroup of A(G) that is isomorphic to 
G.* We do this by constructing an injective homomorphism of groups 
f:G— A(G); then G is isomorphic to the subgroup Im f of A(G) by 
Theorem 7.20. 
If a€ G, then we claim that the map 9,:G— G defined by g(x) = ax is 
a bijection of sets [that is, an element of A(G)]. This follows from the fact 
that if be G, then 9,(a 'b) = a(a'b) = b; hence, œ, is surjective. If (b) = 
fc), then ab = ac. Canceling a by Theorem 7.5, we conclude that b = c. 
Therefore, ~, is injective and, hence, a bijection. Thus g, E€ A(G). 
Now define fG —> A(G) by f (a) = Fa. For any a, b E G, f(ab) = ga is 
the map from G to G given by 9,,(x) = abx. On the other hand, f(a) ° f(b) = 
a ° ©» is the map given by (p, ° g(x) = PLPL) = G{bx) = abx. Therefore, 
S(@®) = f (a)° f(b)and fis a homomorphism of groups. Finally, suppose 
f(a) = f (c), so that gx) = e{x) for all x E G. Then a = ae = ofe) = of) = 
ce = c. Hence, f is injective. Therefore, G = Im f by Theorem 7.20. E 


Corollary 7.22 


Every finite group G of order n is isomorphic to a subgroup of the symmetric 
group Sp. 


Proof > The group Gis isomorphic to a subgroup H of A(G) by the proof of 
Theorem 7.21. Since G is a set of n elements, A(G) is isomorphic to S, 
by Exercise 38. Consequently, H is isomorphic to a subgroup K of S, by 
Exercise 22. Finally, by Exercise 21, G= H and H = KimplythatG=K. E 


Any homomorphism from a group G to a group of permutations is called a 
representation of G, and G is said to be represented by a group of permutations. The 
homomorphism G —> 4 (G) in the proof of Theorem 7.21 is called the left regular repre- 
sentation of G. By the use of such representations, group theory can be reduced to the 
study of permutation groups. This approach is sometimes very advantageous because 
permutations are concrete objects that are readily visualized. Calculations with per- 
mutations are straightforward, which is not always the case in some groups. In certain 
situations, group representations are a very effective tool. 

On the other hand, representation by permutations has some drawbacks. For one 
thing, a given group can be represented as a group of permutations in many ways—the 
homomorphism G— A(G) of Theorem 7.21 is just one of the possibilities (see Exercises 
49, 51, and 54 for others). And many of these representations may be quite inefficient. 
According to Corollary 7.22, for example, every group of order 12 is isomorphic to a 
subgroup of Sj, but Si; has order 12! = 479,001,600. Determining useful information 
about a subgroup of order 12 in a group that size is likely to be difficult at best. 

Except for some special situations, then, the study of elementary group theory via 
the abstract definition (as we have been doing) rather than via concrete permutation 
representations is likely to be more effective. The abstract approach has the advantage 
of eliminating nonessential features and concentrating on the basic underlying struc- 
ture. In the long run, this usually results in simpler proofs and better understanding. 


*The group A(G) itself is usually far too large to be isomorphic to G. For instance, if G has order n, 
then A(G) has order n! by Exercise 20 of Section 7.1. 
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E Exercises 


A. 1. (a) Show that the function £R > R given by f(x) = 3x is an isomorphism of 
additive groups. 


(b) Let R** be the multiplicative group of positive real numbers. Show that 
fi R** — R** given by f(x) = 3x is not a homomorphism of groups. 


2. Show that the function g:R** — R** given by g(x) = Vx is an isomorphism. 


3. Show that GL(2, Z3) is isomorphic to S, by writing out the operation tables 
for each group. [Hint: List the elements of GL(2, Z,) in this order: 


10 1 1 0 1 0 1 11 10 
i K (i ry (| D p a G y G a) and the elements 
angi 123 1 28 12 3 t23 
of S5 in this order: ( 2 iy i. 1 h € A 3 ( : i), 
£23 L23 ) 
a2 27 MAD 
4. Prove that the function {R* > R* defined by f(x) = X is an isomorphism. 


5. Prove that the function g:Z, — Zo defined by g(x) = 2x is an isomorphism. 


6. Prove that the function 4:Z, — Z; defined by A(x) = 2x isa homomorphism 
that is neither injective nor surjective. 


7. Prove that the function f‘R* — R** defined by f(x) = |x] is a surjective 
homomorphism that is not injective. 


8. Prove that the function g:R — R* defined by g(x) = 2* is an injective 
homomorphism that is not surjective. 


9. If Gand Hare groups, prove that the function iG x H — G given by f(a, b)) = 
a is a surjective homomorphism. 


10. Show that the function fR > R defined by f(x) = x? is not a homorphism. 


11. Prove that the function g:R* + GL(2, R) defined by g(x) = p J is an 
injective homomorphism. x 


0 
12. Prove that the function A:R + GL(2, R) defined by A(x) = C 4 is an 
injective homomorphism. F 


13. Show that U, is isomorphic to Uj. 


14. Prove that the additive group Z, is isomorphic to the multiplicative group of 
nonzero elements in Z,. 


15. Let iG — H be a homomorphism of groups. Prove that for each a € Gand 
each integern, f(a") = f(a)". 


16. If fG — H is a surjective homomorphism of groups and G is abelian, prove 
that H is abelian. 
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17. 
18. 


19. 


20. 


21. 


22. 


23. 


26. 


27. 


28. 


29. 


Prove that the function fin the proof of Theorem 7.19(1) is a bijection. 
Let G, H, G,, H, be groups such that G = G, and H = H,. Prove that 
GX H=G, XH, 


Prove that a group Gis abelian if and only if the function fG —> G given 
by f(x) = x7! is a homomorphism of groups. In this case, show that fis an 
isomorphism. 


Let N be a subgroup of a group G and let a E G. 

(a) Prove that a'Na = {a-'na | nE N} is a subgroup of G. 

(b) Prove that Nis isomorphic to a~'!Na. [Hint: Define f:N — a—'Na by 
f(n) = a'na] 

Let G, H, and K be groups. If G = H and H = K, then prove that G = K. 


[Hint: If f:G— H and g:H — K are isomorphisms, prove that the composite 
function g ° f:G— Kis also an isomorphism.] 


If G— H is an isomorphism of groups and if Tis a subgroup of G, prove 
that T is isomorphic to the subgroup f(T) = {fa) | ae T} of H. 


(a) If Gis an abelian group, prove that the function /?G — G given by 
f(x) = x is a homomorphism. 


(b) Prove that part (a) is false for every nonabelian group. [Hint: A counter- 
example is insufficient here (Why?). So try Exercise 24 of Section 7.2.] 


. Let G be a multiplicative group. Let G be the set G equipped with a new 


operation + defined by a x b = ba. 
(a) Prove that G” is a group. 
(b) Prove that G = G”. [Hint: Corollary 7.6 may be helpful.] 


. Assume that a and b are both generators of the cyclic group G, so that G = 


{a) and G = (b). Prove that the function f:G > G given by f(a‘) = b' is an 
automorphism of G. 


If G = (a) is a cyclic group and f:G — H is a surjective homomorphism of 
groups, show that f(a) is a generator of H, that is, H is the cyclic group ( f(a). 
[Hint: Exercise 15.] 


Let G be a multiplicative group and c a fixed element of G. Let H be the set G 
equipped with a new operation + defined by a + b = acb. 


(a) Prove that H is a group. 
(b) Prove that the map fG — H given by f(x) = c7'x is an isomorphism. 


Let f:G— H bea homomorphism of groups and suppose that a e G has finite 
order k. 


(a) Prove that f(a) = e. [Hint: Exercise 15.] 
(b) Prove that |f(a)| divides |a|. [Hint: Theorem 7.9.] 


If £G— H is an injective homomorphism of groups and a € G, prove that 


Lf(@)| = lal. 


(Copyright 2012 Cangngp Lemrning. All Rights Remrend. May not ba opid, «rene, or dapticand, in whole or ta part Dus to electronic cights, ame third party contest may de mppamad fen the eBook adur 


eChagmar(s). 
Gad that any ppw costed! does oct mammieity fisc the crani arming aapeiame Camgegy | cerning Garwa the rigit to suva efdixiconal costed! at eay time if abamu igit cemrictirms require it. 


‘Edimrial eview has 


7.4 lsomorphisms and Homomorphisms 225 


30. Let fG — H be a homomorphism of groups and let K bea subgroup of H. 
Prove that the set {ae G | f(a) E K} is a subgroup of G. 


31. If fG— Gis a homomorphism of groups, prove that F = {a EG | f(a) = a} is 
a subgroup of G. 


32. If A= (? 2) is a matrix, the number ad ~ bc is denoted det 4 and called 


the determinant of A. Prove that the function f: GŁ(2, R) > R* given by 
S(A) = det A is a surjective homomorphism. 


33. Let f:G + H be a homomorphism of groups and let K= {a EG | f(a) = ey}, 
that is, the set of elements of G that are mapped by f to the identity element 
of H. Prove that Kis a subgroup of G. See Exercises 34 and 35 for examples. 


34. The function f:Z — Z; given by f(x) = [x] is a homomorphism by Example 13. 
Find K; (notation as in Exercise 33). 


35. The function f: U; —> U; given by f (x) = x’ is a homomorphism by Exercise 23. 
Find K;(notation as in Exercise 33). 


36. Let G be a group and let Aut G be the set of all automorphisms of G. Prove 
that Aut G is a group under the operation of composition of functions. 
[Hint: Exercise 21 may help.] 

37. Let G be a group and let Aut G be as in Exercise 36. Let Inn G be the set of 
all inner automorphisms of G (that is, isomorphisms of the form f(a) = cae 
for some ¢ € G, as in Example 9.). Prove that Inn Gis a subgroup of Aut G. 


[Noze: Two different elements of G may induce the same inner automorphism, 
that is, we may have e~!ac = d-!ad for all ae G. Hence, [Inn G = |G. 


38. Let T be a set n elements and let A(T) be the group of permutations of T 
Prove that A(T) = S,- [Hint: If the elements of T in some order are relabeled as 
1, 2, . . . , n, then every permutation of T becomes a permutation of 1, 2,..., n.] 


39. Show that the additive groups Z and Q are not isomorphic. 


In Exercises 40-44, explain why the given groups are not isomorphic. (Exercises 16 


and 29 may be helpful.) 
40. Zand $; 41. Z, X Z, and D, 
42. Z, x Z, and Z, x Z, x Z, 43. Us and Uio 4. Uo and Un 


45. Is U; isomorphic to U,,? Justify your answer. 


46. Prove that the additive group R of all real numbers is not isomorphic to the 
multiplicative group R* of nonzero real numbers. [Hint: If there were an 
isomorphism fR — R*, then k) = —1 for some k; use this fact to arrive at a 
contradiction.] 


47. Show that D, is not isomorphic to the quaternion group of Exercise 16 of 
Section 7.1. 


48. Prove that the additive group Q is not isomorphic to the multiplicative group 
Q** of positive rational numbers, even though R and R** are isomorphic. 
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49. Let G be a group and let A(G) be the group of permutations of the set G. 
Define a function g from G to A(G) by assigning to each dE G the inner 
automorphism induced by d~ (as in Example 9 with c = d~'), Prove that gis 
a homomorphism of groups. 


50. Let G be a group and hk E A(G). Assume that ho ¢, = 9, °h for all a E G 
(where ¢, is as in the proof of Theorem 7.21). Prove that there exists b € G 
such that A(x) = xb~! for all x € G. 


51. (a) Let Gbea group and ec E G. Prove that the map 6,:G — G given by 
0,(x) = xe! is an element of A(G). 


(b) Prove that h: G—» A(G) given by A(c) = 6, is an injective homomorphism 
of groups. Thus G is isomorphic to the subgroup Im A of A(G). This is the 
right regular representation of G. 


52. Find the left regular representation of each group (that is, express each group 
as a permutation group as in the proof of Theorem 7.21): 


(a) Z, b) Z {c) S3 

53. Let f:G— H be an isomorphism of groups. Let g:H — G be the inverse 
function of f as defined in Appendix B. Prove that g is also an isomorphism of 
groups. [Hint: To show that g(ab) = g(a)g(b), consider the images of the left- 


and right-hand sides under fand use the facts that fis a homomorphism and 
f° gis the identity map.] 


54. (a) Show that D; = S}. [Hint: D, is described in Example 6 of Section 7.1 
or 7.1.A. Each motion in D, permutes the vertices; use this to define a 
function from D; to S;3.] 


(b) Show that D, is isomorphic to a subgroup of S,. [Hint: See the hint 
for part (a). This isomorphism represents D4, a group of order 8, asa 
subgroup of a permutation group of order 4! = 24, whereas the left 
regular representation of Corollary 7.22 represents G as a subgroup 
of Ss, a group of order 8! = 40,320.] 


(Aan =n 
. (a) Prove that H= {( i ih 


nm 
nm 


ne z} is a group under matrix 


multiplication. 
(b) Prove that H = Z. 


1 —2n n 
56. (a) Prove that K = {( i 4 il 


ne z} is a group under matrix 
multiplication. 
(b) Is K isomorphic to Z? 


57. Prove that the additive group Z[x] is isomorphic to the multiplicative group 
Q** of positive rationals. [Hint: Let Po, Pi, P2, . . . be the distinct positive 
primes in their usual order. Define o:7[x] > Q** by 


Play + ax + ax? + +++ + a,x") = pepi +++ py] 
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58. Prove that Gis an abelian group if and only if Inn G consists of a single 
element. [Hint: See Exercise 37] 


59. (a) Verify that the group Inn D; has order 4. [Hint: See Exercise 37.] 
(b) Prove that Inn D, = Z, X Z} 


60. Prove that Aut Z = #,. [Hint: What are the possible generators of the cyclic 
group Z? See Exercises 25 and 26.] 


61. Prove that Aut Z, = U,,. [Hint See Exercise 25 above and Exercise 44 of 
Section 7.3.] 


62. Prove that Aut (Z, X Z3) = S} 


APPLICATION: Linear Codes (Section 16.1) may be covered at this point 
if desired. 


75 The Symmetric and Alternating Groups* 


The finite symmetric groups S, are important because, as we saw in Corollary 7.22, 
every finite group is isomorphic to a subgroup of some S,. In this section, we introduce 
a more convenient notation for permutations, and some important subgroups of the 
groups S,. We begin with the new notation. 
123456 
143625 
is mapped to 6, 6 is mapped to 5, 5 is mapped back to 2, and the other two elements, 
1 and 3, are mapped to themselves. All the essential information can be summarized 
by this diagram: 


Consider the permutation ( ) in S,. Note that 2 is mapped to 4, 4 


a, a 


It isn’t necessary to include the arrows here as long as we keep things in the same order. 
A complete description of this permutation is given by the symbol (2465), with the 
understanding that 


each element is mapped to the element listed immediately to the right; 
the last element in the string is mapped to the first; 


elements not listed are mapped to themselves. 


“Except for a few well-marked examples and exercises, this section is needed only in Sections 8.5, 
9.3-9.5, and 12.3. 


(Copyright 2012 Caagags Lemming AX Right Ramrvat May act be pied, erama, or duplicat, in winds or ta part Dus to efecronic rights, nme thicd party contest may be mppmaai fmn the eBook zefor oChagiar{s). Edimvial eise has 
@eemed that moy appead content dows oct matacaly affect the oyani Luring eapeeare Cmecge Learning marvas the right w sarve iddaosi content at any the if edeexquew) rights cesyictime ragair it. 


228 Chapter? Groups 


This is an example of cycle notation. Here is a formal definition. 


Definition Let 2, 2, @ ..., 2 (with k = 1) be distinct elements of the set {1, 2, 3, ..., A}. 
Then (aa ~ . . ax) denotes the permutation in $, that maps a, to a», a to 
ay... Ag ÉO â% and a, tO 2, and maps every other element of {1, 2, 3, ...4,.} 
to itself. (a,a)a,...a) is called a cycle of length & or a k-cycle. 


EXAMPLE 1 


In S4, (143) is the 3-cycle that maps 1 to 4, 4 to 3, 3 to 1, and 2 to itself; it was 


4213 
(431) or (314) since each of these indicates the function that maps | to4, 4 to 3, 
3 to l,and2 to2. 


1234 
written ( ) in the old notation. Note that (143) may also be denoted by 


EXAMPLE 2 


According to the definition above, the 1-cycle (3) in S, is the permutation that 
maps 3 to 3 and maps every other element of {1, 2, , . . , #} to itself; in other 
words, (3) is the identity permutation. Similarly, for any k in {1, 2, ..., #}, the 
1-cycle (k) is the identity permutation. 


Strictly speaking, cycle notation is ambiguous since, for example, (163) might de- 
note a permutation in Sę, in S}, or in any S, with n = 6. In context, however, this 
won't cause any problems because it will always be made clear which group S, is under 
discussion. 

Products in cycle notation can be visually calculated just as in the old notation. For 
example, we know that 


12S Ay fl 22°45 _ ft 24a 

L ears. 41 6B 4312/7 
(Remember that the product in S, is composition of functions, and so the right-hand 
permutation is performed first.) In cycle notation, this product* becomes 


Ed yf 3 4 a= 4 2 3). 
Se 


The arrows indicate the process: 1 is mapped to 2 and 2 is mapped to 4, so that the 
product maps 1 to 4. Similarly, 4 is mapped to 3 and 3 is mapped to 2, so that the 
product maps 4 to 2. 


*Hereafter we shallomitthecomposition symbol» andwritethe group operationin S, multiplicatively. 
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EXAMPLE 3 


In the old notation S, consists of 


Fet o Hre ad H 
12 3u 3 als 2 Fu 1 aye s 1 fs 1 af 


In the new notation, the elements of S, (in the same order) are 


(1), (23), (13), (12), (123), and (132). 


Two cycles are said to be disjoint if they have no elements in common. For instance, 
(13) and (2546) are disjoint cycles in S¢, but (13) and (345) are not since 3 appears in 
both cycles. 


EXAMPLE 4 


As shown before Example 3, (243)(1243) = (1423). Verify that 

(1243)(243) = (2341). 
Hence, the cycles (243) and (1234) do not commute with each other. On the other 
hand, you can easily verify that the disjoint cycles (13) and (2546) do commute: 


(13)(2546) = (3 : i 3 i >) = (2546013). 


This is an illustration of the following theorem. 


Theorem 7.23 


Ifø = (2a) + «+ a) and T = (b,b; * + + b,) are disjoint cycles in Sp, then a7 = ta.* 


Proof» Exercise 18. m 


It is not true that every permutation is a cycle, but every permutation can be 
expressed as the product of disjoint cycles. Consider, for example, the permutation 
1234567 
fe 1724 63 
where it is sent by the permutation: 


) in S;. Find an element that is not mapped to itself, say 1, and trace 


1 is mapped to 5, 5 is mapped to 4, 4 is mapped to 2, and 
2 is mapped to 1 (the element with which we started). 


*Greek letters are often used to denote permutations. We shall generally use the letters alpha (a), 
beta (8), delta (8), sigma (g), and tau (7). For the entire Greek alphabet, see the inside back cover of 
this book. 


Copmtgte 2012 Compe» Leasing AN Siaka Ramen May aot ba cosied essaed, or anpticmed, in wocie oc tm pect. Due to electronic dadi», mme third party coutalt may be axppmmat fren the eBook and/or eChagiar(s). Edumrial review has 
that any uppa osteo dona oot manedaDy affect the ovaali aming anperiama. Camgige Learming casarves tha right w Seve efidiional oant at any thne if shampona rights cescrictinas require it. 


230 Chapter7 Groups 


Thus the given permutation has the same action as the cycle (1542) on these four 
elements, Now look at any element other than 1, 5, 4, 2 that is not mapped onto itself, 
say 3. Note that 


3 is mapped to 7, and 7 is mapped to 3. 


Thus the 2-cycle (37) has the same action on 7 and3 as the given permutation. The only 
element now unaccounted for is 6, which is mapped to itself. You can now easily verify 
that the original permutation is the product of the two cycles we have found, that is, 


T2345 6 7 
( LERF 7) = (1542)(37), 


Although some care must be used and the notation is more cumbersome, essentially 
the same procedure works in the general case. 


Theorem 7.24 


Every permutation in S, is the product of disjoint cycles.* 


Proof» Adapt the procedure in the preceding example; see Exercise 44. W 


Theorem 7.29 


The order of a permutation 7 in S, is the least common multiple of the lengths 
of the disjoint cycles whose product is r.t 


Proof» Exercise 19. m 
EXAMPLE 5 
The permutation T = (12)(34)(567) is a product of disjoint cycles of lengths 2, 2, 


and 3. The least common multiple of 2, 2, and 3 is 6. Theorem 7.25 tells us that 
7 has order 6. You can verify this directly by computing the powers of T: 


T = (12)(34)(567), T? = (576), T = (12)(34), 
T* = (567), Tê =(12)(34)(576), 7S=(). E 
The Alternating Groups 


A 2-cycleis often called a transposition. Transpositions have some interesting properties. 


EXAMPLE 6 


If (ab) is a transposition, verify that (ab)(ab) = (1). Hence, 


Every transposition is its own inverse. 


*As usual, we allow the possibility of a product with just one cycle in it. 
tThe least common multiple is defined in Exercise 31 of Section 1.2. 
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EXAMPLE 7 


We claim that the inverse of the product (12)(34)(14)(13) is (13)(14)(34)(12) (the 
same transpositions in reverse order). To prove this claim, we use the fact that a 
transposition is its own inverse: 


(12)(34)(14)(13) « (13)(14)(34)(12) = (12)(34)(14) - (14)(34)(12) 
= (12)(34) » (34)(12) = (12)(12) = (1). 
A similar argument works in the general case and shows that 
If Ci Ox Oy, ++ 5 Fy_1, and g, are transpositions, then 


1 — a 
(T0103 ° On- 10a) 5 OTa O30201- 


You can easily verify that 
(1) = (12)(12), (123) = (12)(23), (1234) = (12)(23)(34). 


These are examples of the following theorem. 


Theorem 7.26 


Every permutation in S, is a productof(notnecessarily disjoint) transpositions. 


Proof» Since every permutation is a product of cycles by Theorem 7.24, we need 
only verify that every cycle (ajaz * + + a,) is a product of transpositions: 


(aya, * + * az) = (aya)(a0) + + + (ak 14). M 
This corollary can also be proved directly by induction, without using Theorem 7.24 
(Exercise 33). 
A permutation in S, is said to be even if it can be written as the product of an 


even number of transpositions, and odd if it can be written as the product of an odd 
number of transpositions. 


EXAMPLE 8 


(132) is even and (1243)(243) is odd because, as you can easily verify, 
(132) =(12)(13) and = (1243)(243) = (23)(34)(14). 


Since no integer is both even and odd, the even-odd terminology for permutations 
suggests that no permutation is both even and odd. This is indeed the case, but it 
requires proof. The first step is to prove 


Lemma 7.27 


The identity permutation in S, is even, but not odd. 


Proof » We write the identity permutation as (1). Verify that (12)(12) = (1). 
Hence, the identity permutation is even. To show that it is not odd, we 
use a proof by contradiction. Suppose that (1) = Tg * * * TaT; with each 7, 
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a transposition and k odd. Let c be a symbol that appears in at least one 
of these transpositions. Let 7, be the first transposition (reading from 
right to left) in which c appears, say T, = (cd). Then c does not appear in 
T,-1:'‘* T, and is, therefore, left fixed by these transpositions If r = k, 
then c is left fixed by all the 7’s except Ty so that the product—the iden- 
tity permutation—maps ¢ to d, a contradiction. Hence, r < k. 

Now consider the transposition 7,, ,. It must have one of the follow- 
ing forms (where x, y, c, d denote distinct elements of {1, 2, ++ + n$: 


I. (xy) IL. (xd) III. (cy) IV. (cd). 
Consequently, there are four possibilities for the product 7,, )7,: 
I. (xy)(ed) II. (xd)(ed) HI. (ey)(ed) IV. (cd)(cd). 


In Case I, verify that (xy)(cd) = (cd)(xy). Replace (xy)(cd) by (cd)(xy) in 
the product; this moves the first appearance of ¢ one transposition to the 
left. In Case TI, verify that (xd)(ed) = (xc)(xd); if we replace (xd)(cd) by 
(xe)(xd), then once again the first appearance of ¢ is one transposition far- 
ther left. Show that a similar conclusion holds in Case III by verifying that 
(y)cd) = (eddy). 

Each repetition of the procedure in Cases I—III moves the first ap- 
pearance of c one transposition farther left. Eventually Case IV must 
occur; otherwise, we could keep moving c until it first appears in the last 
permutation at the left, TŁ, which is impossible, as we saw in the first para- 
graph. In Case IV, however, we have T,417, = (cd)(cd) = (1). So we can 
delete these two transpositions and write (1) as a product of two fewer 
transpositions than before. Obviously, we can carry out the same argu- 
ment for any symbol that appears in a transposition in the product. If the 
original product contains an odd number of transpositions, eliminating 
two at a time eventually reduces it to a single transposition (1) = (ad), 
which is a contradiction. Therefore, the identity permutation (1) cannot 
be written as the product of an odd number of transpositions. W 


Theorem 7.28 


No permutation in S, is both even and odd. 


Proof» Suppose a E S, can be written as C102 * * * Og and as T;T2 + ++ T, with 
each g;, Ty a transposition, k odd, and r even. Since every transposition is 
its own inverse, Corollary 7.6 shows that 


(1) = aa = (1+ on) (1+ TN 


= O° OT," Ty 


Since k is odd and r is even, k + r is odd, and we have written (1) as the 
product of an odd number of transpositions. This contradicts Lemma 7.27, 
and completes the proof of the theorem. W 
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The set of all even permutations in S, is denoted A, and is called the alternating 
group of degree n; the word “group” is justified by the following theorem. 


Theorem 7,29 


A, is a subgroup of S, of order nl/2. 


Proof» 1f œ and Bare in A,, then a = 0103' * ‘Op and B = TiTa auia with each 
On Ta transposition and k, r even. Thus, a8 = 010° ** OKTiT2* ** Tp 


Since k + ris even, aß E Ap 2 A, is closed under multiplication, By 


Example 7, a7! = 0gp 1 * * * 2). Since k is even, a~’ € A,. Therefore, 


A, is a subgroup by Theorem 7.11. Exercise 24 shows that |A,| = #!/2. E 


EXAMPLE 9 


The elements of S, are listed in Example 3. Because |S;| = 3!, we know that 
|A3| = - = 3. Since (12), (13), and (23) are obviously odd, A; must consist of 
(123), (132), and (1). 


E Exercises 


A. 1. Write each permutation in cycle notation: 


123456789 123456789 
@) 721456389 ©) 243576891 
) 123456789 () 123456789 
C 481752639 125476938 
2. Compute each product: 
(a) (12)(23)(34) (b) (246)(147)(135) 
(e) (12)(53214)(23) (d) (1234)(2345) 
3. Express as a product of disjoint cycles: 
123456789 123456789 
(@) Pri tr ©) Ce era 
123456789 
d 
(O) < 5124987 o (d) (14)(27)(523)(34)(1472) 


(e) (7236)(85)(571)(1537)(48 6) 


4. Write each permutation in Exercise 3 as a product of transpositions. 
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5. 


20. 


Find the order of each permutation. 


(a) (12) (b) (123) (c) (1234) 
(d) What do you think the order of (123456789) is? 


. Find the order of each permutation. 


(a) (13)(24) (b) (123)(456) (c) (123)(435) 
(d) (1234)(4231) (e) (1234)(24)(43215) 


. Which of these permutations are even: 


(a) (2468) (b) (246)(134) (c) (12)(123)(1234) 


. List the elements in each group: 


(a) 4 (b) Ay 


. What is the order of each group: 


(a) 44 (b) As (c) Ato 


. Is the set B, of odd permutations in S,, a group? Justify your answer. 
. List the order of each element of 44. 

. Write (12)(34) as the product of two 3-cycles. 

. Show that a = (123)(234)(567)(78910) has order 10 in S, (n = 10). 


[Hint: Write œ as a product of disjoint cycles and use Theorem 7.25.] 


. Show that 6 = (1236)(5910)(465)(5678) has order 21 in S, (n = 10). 

. Prove that the cycle (aja; - > - a) is even if and only if k is odd. 

. Show that the inverse of (aja; + + + ay) in S, is (aag; * * * aza). 

. Prove that a k-cycle in the group S, has order k. 

. Let g = (aya, + + + ay) and T = (b,b - - - b,) be disjoint cycles in S,. Prove that 


oT = To. [Hint: You must show that ot and to agree as functions on each i 
in {1, 2,..., n}. Consider three cases: ij is one of the a’s; jis one of the 5’s; iis 
neither.] 


. Prove Theorem 7.25: The order of a permutation 7 in S, is the least common 


multiple of the lengths of the disjoint cycles whose product is T. 
[Hint: Theorem 7.23 and Exercise 17 may be helpful.] 


Let a and £ be permutations in S,. 
(a) Fill the blanks in the table. 


(b) What conclusions can you draw from the results in part (a). 
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21. 


23. 


26. 
27. 
. Let g be a k-cycle in S,,. 


32. 


33. 


. Let 7 be a transposition and let g E€ S,. Prove that ara” 
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Find the order of g!™, where ø is the permutation 


¢ 2345678 A 
[Hint: Write g as a product of disjoint cycles.] 


378945216 


. Show that So contains elements of orders 10, 20, and 30. Does it contain an 


element of order 40? 


Prove that {(1), (12)(34), (13)(24), (14)(23)} is a subgroup of 44. 


. Let B, denote the set of odd permutations in S,. Define a function f:A, > B, 


by fla) = (12)a. 
(a) Prove that fis injective. 


(b) Prove that fis surjective. [Hint: If B E€ B,, then (12)8 €A,.] 
So fis bijective. Hence, 4, and B, have the same number of elements. 


(c) Show that |A,| = !/2. [Hint: Every element of S, is in A, or B, (but not 
both) and|S,| = aL] 


See Exercise 39(a) and (b) for a generalization of this exercise. 


. Show that the subgroup G of S4 generated by the elements g = (1234) and 


T = (24) has order 8. 
Prove that the center of S, (n > 2) is the identity subgroup. 
If ø is a k-cycle with k odd, prove that there is a cycle T such that 77 = g. 


(a) Prove that a” is a cycle if and only if k is odd. 
(b) If k = 2t, prove that there are t-cycles T and £ such that o = 7B. 


. Let g and 7 be transpositions in S, with n = 3. Prove that ør is a product of 


(not necessarily disjoint) 3-cycles. 


. Prove that every element of A, is a product of 3-cycles. 
. Let g be a product of disjoint cycles of the same length. Prove that g isa 


power of a cycle. 


Prove that the decomposition of a permutation as a product of disjoint cycles 
is unique except for the order in which the cycles are listed. 


Use induction on n to give an alternate proof of Theorem 7.26: Every 
element of S, is a product of transpositions. [Hint: If the statement is true 
for n = k — 1 and if T E S;, consider the transposition (kr), where r = 7(k). 
Note that (kr)r fixes k and hence may be considered as a permutation of 
AM i ccpaceke N] 


. If n = 3; prove that every element of S, can be written as a product of at most 


n — | transpositions. 


1 is a transposition. 


is 


. If 7 is the k-cycle (ajaz «+ + a,) and if g E S, prove that ota" = 


(a(a;)o(a2) - + + o(a). 
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37. Let H consist of all permutations in S, that fix 1 and n, that is, 
H = {aeES,|a() = l and a(n) = n}. 
Prove that H is a subgroup of S,,. 

38. Show that D, is isomorphic to the group G in Exercise 25. [Hint: Note that 
every element of D, produces a permutation of the vertices of the square 
(see Example 5 in Section 7.1 or 7.1.A.). If the vertices are numbered 1, 2, 
3, 4, then this permutation can be considered as an element of S,. Define a 


function f:D4 —> S4 by mapping each element of D; to its permutation of the 
vertices. Verify that fis an injective homomorphism with image G.] 


39. Let G be a subgroup of S, that contains an odd permutation T. 


(a) Prove that the number of even permutations in G is the same as the 
number of odd permutations in G. 
(b) Explain why 2 divides |G]. 
(c) If Kis a subgroup of S, of odd order, prove that K is actually a subgroup 
of A, 
C.40. Prove that every element of A, is a product of #-cycles. 
41. Prove that the transpositions (12), (13), (14), . . . , (1m) generate S,,. 
42. Prove that (12) and (123 - : - n) generate S,. 
43. If fis an automorphism of S;, prove that there exists ¢ E S, such that 
KT) = ora for every TE S}. 
44. Use the following steps to prove Theorem 7.24: Every permutation 7 in S, is a 
product of disjoint cycles. 


(a) Let a, be any element of {1, 2, . . . , n} such that r(a,) # a,. Let a, = T(a)), 
a; = T(ay), ay = T(a;), and so on. Let k be the first index such that t(a,) is 
one of a;,..., ap — 4. Prove that 7(a,) = a,. Conclude that 7 has the same 
effect on a, ... , a as the cycle (ajaz * + * ap). 


(b) Let 5; be any element of {1, 2, .. ., n} other than a;, . . . , a, that is not 
mapped to itself by 7. Let b, = 7(5,), b3 = T(b2), and so on. Show that 
7(b,) is never one of ay, . . . , ay. Repeat the argument in part (a) to find a 
b, such that 7(b,) = b, and T agrees with the cycle (bib, - - - b,) on the b’s. 


(c) Let cı be any element of {1, 2,..., n} other than the a’s or b’s above such 
that 7(c;) # e,. Let c, = 7(c,), and so on. As above, find c, such that 7 
agrees with the cycle (c,c2 ° > - ¢,) on the c’s. 


(d) Continue in this fashion until the only elements unaccounted for are those 
that are mapped to themselves by 7. Verify that 7 is the product of the cycles 


(a, -apb + BY se) ee: 
and that these cycles are disjoint. 


45. Prove that S, is isomorphic to a subgroup of Anz 
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CHAPTER 8 


Normal Subgroups and Quotient Groups 


Congruence in the integers led to the finite arithmetics Z,, which produced 
a number of interesting results. Now we shall extend the concept of congru- 


ence to groups, producing new groups and a deeper understanding of algebraic 
structure, 


[eu] Congruence and Lagrange’s Theorem 


In this section we present the analogue for groups of the concept of congruence, 
which was introduced for integers in Chapter 2 and for rings in Chapter 6.* Except 
for some notational changes, the first three results of this section are virtually identical 
to those proved earlier for integers and rings. The following chart shows this parallel 
development. 


INTEGERS RINGS GROUPS 
Theorem 2.1 Theorem 6.4 Theorem 8.1 
Theorem 2.3 Theorem 6.6 Theorem 8.2 
Corollary 2.4 Corollary 6.7 Corollary 8.3 


We begin by looking at an example of congruence in Z from a somewhat different 
viewpoint. 


*Chapter 6 is nota prerequisite for this section, but it will be mentioned occasionally. Section 2.1 will 
be the model for the presentation here. 
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EXAMPLE 1 


In the integers, a = b (mod 4) means that 4 divides a — b, that is, that a — bisa 
multiple of 4. Let K be the set of all multiples of 4, so that 


K = {0, +4, +8, +12, ...}. 
Thus, 
a= b (mod 4) means a — bek. 


Note that K is actually a subgroup of Z (the additive cyclic subgroup generated 
by 4). Instead of thinking of congruence modulo the element 4, we can con- 
sider this as congruence modulo the subgroup K: 


a = b (mod K) means a-bek. 


Now let G be any group and K a subgroup of G. The last line of the preced- 
ing example could be used as a definition of congruence modulo K. However, we 
normally use multiplicative notation for groups. So we must translate the pro- 
posed definition and results from Section 2.1 into equivalent statements in multi- 
plicative notation.* The following dictionary may be helpful for this translation. 


ADDITIVE NOTATION MULTIPLICATIVE NOTATION 


a+b ab 

0 e 

oc eo 
a-b=a+(—bd) abt 


Thus, the additive statement a — bE K is equivalent to the multiplicative state- 
ment ab E K, and we have the following definition of congruence. 


Definition Let K bea subgroup of a group G and leta, be G. Then ais congruentto b 
modulo K [written a = b (mod KJ] provided that ab" eK. 


EXAMPLE 2 


Let K be the subgroup {ro ri, r2, 73} of D4. Then the operation table in Example 5 
of Section 7.1 or 7.1.A shows that d~! = dandhod-! = h o d = r, E K. Therefore, 
h = d (mod K). 


*There is a possibility of confusion here since integer multiplication is also defined. In carrying 
over congruence from integers to groups, we consider only the additive structure of the integers 
and ignore integer multiplication because the integers form an additive group, but not a 
multiplicative one. 
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Theorem 8.1 
Let K be a subgroup of a group G. Then the relation of congruence modulo 
Kis 
(1) reflexive: a = a (mod K) for allae G; 
(2) symmetric: if a = b (mod K), then b = a (mod K); 
(3) transitive: if a = b (mod K) and b = c (mod K), then a = c (mod K). 
The idea is to translate the proof of Theorem 2.1 to the present situation by chang- 
ing congruence mod » to congruence mod K and replacing statements such as “x is 
divisible by n” or “n | x” or “x = nt” with the statement “x € K”. We must also change 
additive notation to multiplicative notation by using the dictionary above. It’s straight- 
forward for parts (1) and (3), but a bit trickier for part (2), since integer addition is 
commutative, but the multiplicative operation in G may not be. 


Proof of Theorem 8.1 > (1) aa! = e and eE K. Hence, a = a (mod K). 
(2) a = b (mod K) means ab“ = k for some k € K. Therefore, by 
Corollary 7.6, 
k`! = (ab = (b7 1) 1 a`! ~ ba`!, 
Since K is a group, the inverse of an element of K is also in K. Reading 
the preceding line from right to left, we see that ba~! = k~! € K. Hence, 
b = a (mod K). 
(3) If a = b (mod K) and b = e (mod K), then by the definition of 
congruence, there are r, $s E€ K such that ab! = r and be"! = s. Therefore, 
(ab™)(be~') = rs 
ac™! = rs 
Thus, ac™! € K (because r and s are in K). Hence, a = c (mod K). m 
If K is a subgroup of a group Gand if a E G, then the congruence class of a modulo 
K is the set of all elements of G that are congruent to a modulo K, that is, the set 
{bEG|b = a (mod K)} = {be G| ba eK} 
={beG| ba! = k, with kE K}. 
Right multiplication by a shows that the statement ba™! = k is equivalent to b = ka. 
Therefore, the congruence class of a modulo Kis the set 
{beEG| b= ka, with ke K} = {ka|keK}, 
which is denoted Ka and called a right coset of K in G. In summary: 

The congruence class of a modulo K is the right coset Ka = {ka | with k € K}. 
When the operation in the group Gis addition, then a right coset is denoted K + a.* 
*For those who have read Section 6.1: Cosets of an ideal / in a ring were denoted a + / instead of 
ł + a. It didn't make any difference there because addition in a ring is commutative, soa + i=i+ta 


for every ie}. However, in Section 8.2 we shall see that when G is nonabelian, it is possible to have 
Ka + aK, where aK = {ak | with k € K}. 
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Theorem 8.2 


Let K be a subgroup of a group G and let a, ce G. Then a =c (mod K) if and 
only if Ka = Ke. 


> With minor notationa nges, the proof 1s essentially the same as that o 

Proof» with mi ional changes, the proof i ially th hat of 
Theorem 2.3. Just replace “mod x” with “mod K” and “[a]” with “Ka” and 
use Theorem 8.1 in place of Theorem 2.1. E 


Corollary 8.3 


Let K be a subgroup of a group G. Then two right cosets of K are either 
disjoint or identical. 


Proof» Copy the proof of Corollary 2.4 with the same notational changes as in 
the proof of Theorem 8.2. m 


Lagrange’s Theorem 

At this point we temporarily leave the parallel treatment of congruence in the integers 
and groups and use right cosets to develop some facts about finite groups that have no 
counterpart in the integers. 


Theorem 8.4 


Let K be a subgroup of a group G. Then 
(1) G is the union of the right cosets of K: G = Ka. 


(2) For each acG, there is a bijection fK —> Ka. Consequently, if K is 
finite, any two right cosets of K contain the same number of elements. 


Proof» (1) Since every right coset consists of elements of G, we have UKa SG 
If beG, then = eb Kb c U Ka, sothat Go U Ka. Hence, G= U Ka 


(2) Define f:K — Ka by f(x) = xa. Then by the definition of Ka, fis 
surjective. If f (x) = f(y), then xa = ya, so that x = y by Theorem 7.5. 
Therefore, f is injective and, hence, a bijection. Consequently, if K 
is finite, every coset Ka has the same number of elements as K, 
namely |K]. m 


If H is a subgroup of a group G, then the number of distinct right cosets of 
H in G is called the index of H in G and is denoted [G:H]. If G is a finite group, 
then there can be only a finite number of distinct right cosets of H; hence, the 
index [G:H] is finite. If G is an infinite group, then the index may be either finite 
or infinite. 
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EXAMPLE 3 


Let H be the cyclic subgroup (3) of the additive group Z. Then H consists of all 
multiples of 3, and the cosets of H are just the congruence classes modulo 3; 
for instance, 


H+2={h+2|heR} = {3z +2|2E2Z} = [2]. 
Since there are exactly three distinct congruence classes modulo 3 (cosets of H), 
we have [Z:H] = 3. 


EXAMPLE 4 


Under addition the group Z of integers is a subgroup of the group Q of ratio- 
nal numbers. By the definition of congruence and Theorem 8.2, 


Z+a=Z+c “‘wfandonlyif a-—cedZ. 


Consequently, if 0 < e < a < 1, then Z + aand Z + c are distinct cosets because 
0 <a -— c< 1, which means that a — c cannot be in Z. Since there are infinitely 


many rationals between 0 and 1, there are an infinite number of distinct cosets 
of Z in Q. Hence, [Q:Z] is infinite. 


Theorem 8.5 Lagrange’s Theorem 


If K is a subgroup of a finite group G, then the order of K divides the order of 
G. In particular, |G| = [K| [G:K} 


Proof It is convenient to adopt the following notation. If A is a finite set, then |A| 
denotes the number of elements in A. Observe that if A and B are disjoint 
finite sets, then |4 U B| = |A| + |B]. Now suppose that [G:K] = nand 
denote the n distinct cosets of K in G by Kc,, Kes,..., Kc,. By 
Theorem 8.4 

G = Ke, U Kc U + + U Key 


Since these cosets are all distinct, they are mutually disjoint by Corollary 8.3. 
Consequently, 
IG] = |Ke] + |Ke,] + +++ + |Kegl. 
For each c, however, |Kc| = |K| by Theorem 8.4. Therefore, 
IG] = |K| + |X| + +--+ |IK| =|Kln = |K|[G:K]. m 
n summands 


Lagrange’s Theorem shows that there are a limited number of possibilities for the 
subgroups of a finite group. For instance, a subgroup of a group of order 12 must 
have one of these orders: 1, 2, 3, 4, 6, or 12 (the only divisors of 12). Be careful, 
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however, for these are only the possible orders of subgroups. Lagrange’s Theorem does 
not say that a group G must have a subgroup of order k for every k that divides |G]. 
For instance, the alternating group 44 has order 12 but has no subgroup of order 6 
(Exercise 44). Lagrange’s Theorem also puts limitations on the possible orders of 
elements in a group: 


Corollary 8.6 


Let G be a finite group. 


(1) If aeG, then the order of a divides the order of G. 
(2) If |G] = k, then a = e for every aE G. 


Proof» (1) If a e G has order n, then the cyclic subgroup (a) of G has order n 
by Theorem 7.15. Consequently, n divides |G| by Lagrange’s Theorem. 
(2) If a€ G has order n, then n| k by part (1), say k = nt. Therefore, 

dț =a" = (d= =e. M 


The Structure of Finite Groups 


A major goal of group theory is the classification of all finite groups up to isomor- 
phism; that is, we would like to produce a list of groups such that every finite group is 
isomorphic to exactly one group on the list. This is a problem of immense difficulty, 
but a number of partial results have already been obtained. Theorem 7.19, for exam- 
ple, provides a classification of all cyclic groups; it says, in effect, that every nontrivial 
finite cyclic group is isomorphic to exactly one group on this list: Z2, Z3, Z4, .... All 
finite abelian groups will be classified in Section 9.2. 

We now use Lagrange’s Theorem and its corollary to classify all groups of prime 
order and all groups of order less than 8. In the proofs below enough of the necessary 
calculations are included to show you how the argument goes, but you should take 
pencil and paper and supply all the missing computations. 


Theorem 8.7 


Let p be a positive prime integer. Every group of order p is cyclic and isomor- 
phic to Zp. 


Proof » 1f Gisa group of order p and a is any nonidentity element of G, then 
the cyclic subgroup (a) is a group of order greater than 1. Since the 
order of the group (a) must divide p and since p is prime, (a) must be a 
group of order p. Thus (a) is all of G, and G is a cyclic group of order p. 
Therefore, G = Z, by Theorem 7.19. W 
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Theorem 8.8 


Every group of order 4 is isomorphic to either Z, or Z X Zp. 


Proof > Let G be a group of order 4. Either G contains an element of order 4 or 
it does not. If it does, then the cyclic subgroup generated by this element 
has order 4 by Theorem 7.15 and, hence, must be all of G. Therefore, G 
is a cyclic group of order 4, and G = Z, by Theorem 7.19. 

Now suppose that G does not contain an element of order 4. Let e, a, 

b, c be the distinct elements of G, with e the identity element. Since every 
element of G must have order dividing 4 by Corollary 8.6 and since e is 
the only element of order 1, each of a, b, c must have order 2. Thus the 
operation table of G must look like this: 


e 
e 
a 
b 
c 


aà SA & 


In order to fill in the missing entries, we first consider the product ab. If 
ab = e, then ab = aa and, hence, a = b by cancelation. This is a contra- 
diction, and so ab # e, If ab = a, then ab = ae and b = e by cancelation, 
another contradiction. Similarly, ab = b implies the contradiction a = e. 
Therefore, the only possibility is ab = c. Similar arguments show that 
there is only one possible operation table for G, namely, 


o F FS 

>o SA 01H 
eS ao ® AJA 
aga E T Oo 
eo a Salsa 


Let f: G —> Z, X Z, be given by fle) = (0, 0), a) = (1, 0), Rb) = (0, 1), 
and f(c) = (1, 1). Show that fis an isomorphism by comparing the 
operation tables of the two groups. E 


Theorem 8.9 


Every group G of order 6 is isomorphic to either Z; or S; 


Proof» If Gcontains an element of order 6, then Gis a cyclic group of order 6 
and, hence, is isomorphic to Zs by Theorem 7.19. So suppose G contains 
no element of order 6. Then every nonidentity element of G has order 
2 or 3 by Corollary 8.6. If every nonidentity element of G has order 2, 
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then Gis an abelian group by Exercise 27 of Section 7.2. If c and dare 
nonidentity elements of G, then the set H = {e, c, d, cd} is closed under 
multiplication (because ¢ = e = Æ and cd = dc). Hence, H is a subgroup 
of G by Theorem 7.12. This is a contradiction since no group of order 6 
can have a subgroup of order 4 by Lagrange’s Theorem. Therefore, the 
nonidentity elements of Gcannot all have order 2, and G must contain 
an element a of order 3. Let N be the cyclic subgroup (a) = {e, a, a°} and 
let b be any element of G that is not in N. The cosets Ne = {e, a, a°} and 

= {b, ab, ab} are not identical since bÆ N = Ne and, hence, must be 
disjoint (Corollary 8.3). Therefore, G consists of the six elements e, a, a’, 
b, ab, a*b. 

We now show that there is only one possible operation table for G. 
What are the possibilities for 47? We claim that A cannot be any of a, a’, 
b, ab, or a’b. For instance, if b? = a, then bt = a?. However, b either has 
order 2 (in paih cased = bt = PP = =ee=e,a contradiction) or order 3 
(in which case a? = $ = P’b = eb = b, another contradiction since b ¢ N). 
Similar arguments show that the only possibility is } = e. 
Next we determine the product ba. It is easy to see that ba cannot 

be any of b, e, a, or a” (for instance, ba = a implies b = e). So the only 
possibilities are ba = ab or ba = a’b. If ba = ab, then verify that ba has 
order 6 by computing its powers. This contradicts our assumption that 
G has no element of order 6. Therefore, we must have ba = ab. Using 
these two facts: 


B=e and ba = ab, 
we can now compute every product in G. For example, ba’ = (ba)a = 


(a’b)a = a*(ba) = a’a’b = a’b = ab. 
Verify that the operation table for G must look like this: 


e a a b ab a’b 
e e a a b ab a&b 
a a a e ab ab b 
e | e a ab b ab 
b b ab ab e a 
ab | ab b ab a e a 
ab| ab ab b ef a e 


By comparing tables, show that G is isomorphic to S; under the 


correspondence 
e a a b ab ab 
4 4 4 4 4 4 
1 2 3NI 2 ") t 2 3 $ 2 d I2 3 ie 2» B 
1 2 3/\2 3 U 1 2 1 J 2 IAI 3 y 
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The last three theorems provide a complete classification of all groups of order less 
than 8, as summarized in this table: 


If Ghas order l then G is isomorphic to 
3 Z, 
4 £4 0r Z2 X Za 
5 Zs 
6 Zs or S3 
7 Z 


The classification of groups is discussed further in Chapter 9, particularly in Section 9.5 
where the preceding chart is extended to order 15. 


E Exercises 


A. 1. Let K bea subgroup of a group Gand let aE G. Prove that Ka = K if and only 
if aek. 


In Exercises 2-6, G is a group and K is a subgroup of G. List the distinct right cosets of K in G. 


2. K= {r} v}: G = D, [Fhe operation table for D, is in Example 5 of Section 7.1 
or 7.1.A.] 


3. K= {ro r ra r3}; G = Dy. 


4 K= {get (32) he= S 
5. K= {1, 17}; G = Uy. 
6. K = (3); G = Uy. 
In Exercises 7-11, G is a group and H is a subgroup of G. Find the index [G: H]. 
7. H = {ro, ra}; G = Dy. 
8. H = (3); G = Zp. 
9. H = (3); G= Zy 
10. H is the subgroup generated by 12 and 20; G = Zy. 
1234 
234 iy poak 


12.* (a) Let K = {(1), (12)(34), (13)(24), (14)(23)}. Show that K is a subgroup of 
Ay and hence, a subgroup of S4. [Hint: Theorem 7.12.] 


(b) State the number of cosets of Kin 44. Don’t list them. 
(c) State the number of cosets of K in Sy. Don’t list them. 


11. H is the cyclic subgroup generated by ( 


*Skip this exercise if you haven't read Section 7.5. 
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In Exercises 13-15, K is a subgroup of G. Determine whether the given cosets are 
disjoint or identical. 


13. G= 2; K =(7) 

(a) K+4and K +3 (b) K =4 and K + 137 (c) K + (—4) and K + 59 
14.* G = Sy K is the subgroup of Exercise 12. 

(a) K(12) and K(34) (b) K(1234) and K(1324) 
15. G = Uy; K = (9) 

(a) K17 and K19 (b) K9 and K25 


16. Suppose G is the cyclic group (a) and |a] = 15. If K = (a), list all the distinct 
cosets of Kin G. 


17. What are the possible orders of the subgroups of Gwhen G is 
(a) Zo  (b)S4 (c) DaX Zio 

18. Give examples, other than those in the text, of infinite groups G and H such that 
(a) [G-H] is finite (b) [G:H] is infinite 


19. Let G be a finite group that has elements of every order from 1 through 12. 
What is the smallest possible value of |G|? 


20. A group G has fewer than 100 elements and subgroups of orders 10 and 25. 
What is the order of G? 


21. Let H and K, each of prime order p, be subgroups of a group G. If H # K, 
prove that HN K = {e}. 


22. If H and Kare subgroups of a finite group G, prove that |H N K| is a common 
divisor of |H| and |X]. 


B. 23. If Gis a group with more than one element and G has no proper subgroups, 
prove that Gis isomorphic to Z, for some prime p. 


24. If Gis a group of order 25, prove that either G is cyclic or else every 
nonidentity element of G has order 5. 


25. Let a be an element of order 30 in a group G. What is the index of (a°) in the 
group (a)? 

26. Prove that a group of order 8 must contain an element of order 2. 

27. If n > 2, prove that n — 1 is an element of order 2 in U,,. 

28. If n > 2, prove that the order of the group U, is even. 


29. Let H and K be subgroups of a finite group G such that K G H, [G:H] is finite, 
and [H:K] is finite. Prove that [G:K] = [G:H][H:K]. [Hint: Lagrange.] 


30. Let H and K be subgroups of an infinite group G such that K C H, [G:H] is 
finite, and [H:K] is finite. Prove that [G:K] is finite and [G:K] = [G:H][H:K]. 
[Hint: Let Ha,, Haz, ..., Ha, be the distinct cosets of H in Gand let Kh, 
Kb,, . . . , Kb, be the distinct cosets of K in H. Show that Kha, (with | s i = m 
and 1 = j = n) are the distinct cosets of Kin G.] 


*Skip this exercise if you haven't read Section 7.5. 
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31. If Gis a group of even order, prove that G contains an element of order 2. 


32. If Gis an abelian group of order 2, with n odd, prove that G contains exactly 
one element of order 2. 


33. (a) If a and b each have order 3 in a group and a” = b?, prove that a = b. 
[Hint- What are a~! and b=?) 


(b) If Gisa finite group, prove that there is an even number of elements of 
order 3 in G. 


34. Let G be an abelian group of odd order. If a,, a), a3, ..., a, are the distinct 
elements of G (one of which is the identity e), prove that a,a7a; +--+ a, = e. 


35. If p and q are primes, show that every proper subgroup of a group of order pg 
is cyclic. 

36. Let H and K be subgroups of a finite group G such that [G:H] = p and[G:K] = q, 
with p and q distinct primes. Prove that pg divides [G:H A K]. 


37. Let G be an abelian group of order n and let k bea positive integer. If (k, n) = 1, 
prove that the function f:G — G given by f(a) = a@* is an isomorphism. 


38. If Gis a group of order n and G has 2" ~ ' subgroups, prove that G = (e) or 
G= Ža. 
C. 39. Let G be a nonabelian group of order 10. 


(a) Prove that G contains an element of order 5. [Hint: Exercise 27 of 
Section 7.2.) 


(b) Prove that G contains five elements of order 2. [Hint: Use techniques 
similar to those in the proof of Theorem 8.9.] 


40. If a prime p divides the order of a finite group G, prove that the number of 
elements of order p in G is a multiple of p — 1. 


41. Prove that a group of order 33 contains an element of order 3. 


42. Let G be a group generated by elements a and b such that faj = 4, |b| = 2, and 
ba = @b. Show that G is a group of order 8 and that G is isomorphic to D4. 


43. Let G be a group generated by elements a and b such that |a| = 4, b? = a’, and 
ba = œb. Show that G is a group of order 8 and that Gis isomorphic to the 
quaternion group of Exercise 16 in Section 7.1. 


44.* (a) Show that 4, (which has order 12 by Theorem 7.29) has exactly three 
elements of order 2. 


(b) Prove that the elements of order 2 and the identity element form a 
subgroup. 


(c) Prove that A, has no subgroup of order 6. Hence, the converse of 
Lagrange’s Theorem is false. [Hint: If N is a subgroup of order 6, use 
Theorem 8.9 to determine the structure of N and use part (b) to reach a 
contradiction.] 


*Skip this exercise if you haven't read Section 7.5. 
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82 Normal Subgroups 


Suppose G is a group and Ķ is a subgroup. Our goal in this section and the next is 
to create a new group (if possible), whose elements are the right cosets of K (that is, 
congruence classes mod K)—much as we created Z„, whose elements are congruence 
classes of integers. 

Recall that the definition of addition of congruence classes of integers in Chapter 2 
depended on part (1) of Theorem 2.2, which states 


If a = b (mod n) and c = d (mod n), then a + c= b + d (mod n).* 


If K is a subgroup of a multiplicative group G, then the translation of this statement 
to congruence mod K is 


(*) If a = b (mod K) and ¢ = d (mod K), then ac = bd (mod K). 


Unfortunately, however, statement (+) is false for some subgroups. (see Exercise 2 for 
an example). Nevertheless, there is a class of subgroups for which statement (+) is true. 
We shall identify these “special” subgroups in this section and define multiplication of 
their right cosets in Section 8.3.‘ 

Recall that if K is a subgroup of G, then the right coset Ka is the set Ka = 
{ka|k € K}. Similarly, the left coset aK is defined to be the set 


aK = {ak|keK}. 
EXAMPLE 1 
Let K be the subgroup {79, v} of D4, whose operation table is shown below. The 


right coset Kd is the set {r° d, vo d} = {d, r3} and the left coset dK is the set 
{d° ro, d° v) = {d, r}. So Kd # dK. 


Da v 
s 

v d 

d h 

r, r3 Fo ri ry v d h t 

d d v t h ro rs r ri 

h h d v t ri To r3 ra 

t t h d v r rı ro r3 

v v t h d r3 ra ri ro 


*We don't deal with integer multiplication here because the integers form a group under addition, but 
not under multiplication. Similarly in Chapter 6, when developing the basic facts about congruence 
and cosets in rings, we dealt only with the additive group of a ring and ignored its multiplication. 
tEssentially the same thing was done in Chapter 6 when we needed to prove Theorem 6.5 (the 
analogue of Theorem 2.2 for rings)—the discussion did not apply to every subring, but only to 
ideals, each of which is a special kind of subring. 


(Copyright 2012 Omega Lemming AN Rights Ramryet Mag act ba copied, eran, of dapticamd, in whole or ia pert Dus to electronic cights, anne third party contest may Se mppamai frun the eBook eatur eChagher(s). Edimrial eniow bas 
Gad hat any cqped costed dows act mammilty fect the crani larning apeiare Cagage | cerning @asrves the rigit to cecmve atkficional coatesi at 629 ime if abamu igin cemrictirms require it. 


8.2 Normal Subgroups 249 


EXAMPLE 2 


Let N be the subgroup {ro F1, r2, r3} Of D4. Then the right coset Nv is the set 
Nu = {r° v, r) 0v, r20 v, 730 v} = {v, d, h, B 

and the left coset vN is the same set: 
UN = {voro VOT}, vory vor} = fv, t, h, d}. 


So in this case, Nv = vN.* Similar calculations (Exercise 3) show that every right 
coset of N is also a left coset, that is, 

Nro = oN, Nri=riN, Nri= rN, Nry= rN, 

Nd=dN, Nh=hN, Nt=tN, Nv=N. 


Subgroups with this property have a special name. 


Definition A subgroup N of a group G is said to be normal if Na = aN for everyae G. 


EXAMPLE 3 


N = {ro, r{, ra, r3} is a normal subgroup of D4, but K = {ro, v} is not, as shown 
in Examples | and 2. 


EXAMPLE 4 


If Nis a subgroup of an abelian group G and a E G, then na = an for every 
n EN, so that the right coset Na is the same as the left coset aN. Hence, 


Every subgroup of an abelian group is normal. 


EXAMPLE 5 


Let M be the subgroup {ro, r2} of D4. Then the operation table for D4 in 
Example | shows that ro ° a = a o ro and r20 a = aor, for every a E D,. So it is 
certainly true that Ma = aM for every a E D,. Hence, M is a normal subgroup 
of Ds. 


In Example 5, the subgroup M is the center of D, (see Example 10 of Section 7.3). 
So the center of D4 is a normal subgroup. The same thing is true in general. 


“Remember that the elements of a set may be listed in any order. 
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EXAMPLE 6 


The center Z(G) of a group G is the subgroup 
Z(G) = {cE G| cg = ge for every g E G} 


(Theorem 7.13). Since ca = ac for every ce Z(G) and ae G, we see that 
Z(G)a = aZ(G) for every a E G. Hence, Z(G) is a normal subgroup of G. 


Other examples of normal subgroups appear in Exercises 3-5, 7-9, 14, and 23. 
Examples 4—6, though important, are misleading in that the elements of the normal 
subgroup N commute with all the other elements of the group in each case. In the gen- 
eral case, however, this is not necessarily true. When Nis a normal subgroup of G, then, 


The condition Na = aN does not imply that na = an for every n E N. 


EXAMPLE 7 


As we saw in the Example 2, N = {ro, r, ro, r3} is anormal subgroup of D4. In 
particular, Nv = 4N. However, v does not commute with all the elements of N. 
For instance, r, o v E Nv and vo r} E vN, but the operation table for D, shows that 


rov=t and vor,=d, SO fz30 V $É vory, 


even though Nv = vN. 


Thus, if N is a normal subgroup of G, the elements of N may not commute 
with every element of G. Nevertheless, you can think of the normal subgroup N 
as providing a weak version of commutativity in the following sense. 


If n E N, and a & G, then for some n, n, E N, 


na=an, and an= ny, 
because na E Na and Na = aN and similarly, an € aN and aN = Na. 
EXAMPLE 8 
Once again, consider the normal subgroup N = {ro, r1, 2, r3) of D4. The 
operation table for D4 shows that r, o v = t and vor, = t. Hence, 
fov Svor. 


This is the first part of the preceding boldface statement, with n = r}, @ = v, 
and n =r. 


Our goal at the beginning of this section was to find a class of subgroups for which 
statement (+) on page 248 (the group theory analogue of Theorem 2.2) is true. Normal 
subgroups are exactly what’s needed. 
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Theorem 8.10 


Let N be a normal subgroup of a group G. 
If a = b (mod N) and c = d (mod N), then ac = bd (mod N}. 


The proof is essentially a translation into multiplicative notation of the proof 
of part (1) of Theorem 2.2, with commutativity of integers replaced by the weak 
commutativity in G provided by the normal subgroup N. 


Proof of Theorem 8.10 > By the definition of congruence, there are elements 
m, nE K such that ab“! = mand cd ' = n. Then 
(ac)(bd)“' = acd-"h- [Corollary 7.6) 
= anb! [Because cd = n] 


Now an € aN and aN = Na by normality, so an = nya for some n, E N. Hence, 
(acbd)! = anb™' 
= nab" 
=nm [Because ab! = meN] 
Therefore, (ae)(bd)' = nm EN, and ac = bd (mod N). E 


We close this section with a theorem that provides alternate descriptions of nor- 
mality. Verifying condition (2) or (3) in the theorem is often the easiest way to prove 
that a given subgroup is normal. 


Theorem 8.11 


The following conditions on a subgroup N of a group G are equivalent: 


(1) N is a normal subgroup of G. 

(2) a~'Na & N for every ae G, where a~'Na = {a™na|ne N}. 
(3) aNa~'< N for every ae G, where aNa™' = {ana jn EN). 
(4) a-'Na = N for every ae G. 

(5) aNa7' = N for everya eG. 


Note that in (4), a-!Na = N does not mean that a~'wa = n for each nEN; 
all it means is that a~'na = n; for some n; E N. Analogous remarks apply to (2), 


(3), and (5). 


Proof of Theorem all > (1) => (2) Suppose nE N and a-'na ca Na. We must 
show that ana € N. Note that na is an element of the right coset Na. 
Since Ni is normal by (1), Na = aN. Hence, na = an, for some n, EN. 
Thus ana = a” 'an, = en, = n EN. Therefore, a 'Na S N. 
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(2) + (3) If (2) holds for every element of G, then it holds with a7 in 
place of a, that is, 


(++) (a) "NaN. 


But (a~})“! = a, so that (++) is statement (3): aNa™! & N. Similarly, if 
(3) holds for every element of G, then it holds with a~! in place of a, 
which implies statement (2). 

(3) => (4) Since (3) implies (2), we have a~'Na & N. To prove 
N Sa™'Na, suppose n E N. Then n = a~'(ana™ Ja. By (3) ana! = m for 
some mE N. Thus n = a~'!n.a€a'Na, which proves that N&a-'Na. 
Therefore, a'Na = N. 

(4) = (5) If (4) holds for every element of G, then it holds with a in 
place of a, that is, 


N = (ay t Na = aNa"', 


Similarly, if (5) holds for every element of G, then it holds with a“ in 
place of a, which implies statement (4). 


(5) = (1) Suppose n € N and anc aN. Then ana! CaNa! = N by (5), 
so that ana~' = n, for some n, € N. Multiplying this last equation on the 
tight by a shows that an = ma E Na. Therefore, aN & Na. Conversely, if 
na E Na, then a'nac a~'Na = N because (5) implies (4). Hence, a~'na = 
n, for some n, E N. Multiplying on the left by a shows that na = an, E aN. 
Thus Na & aN. Therefore, Na = aN for every a € G and Nis a normal 
subgroupof G. E 


EXAMPLE 9 


F 12 3\/1 2 3\/1 BN V- 
Verify that A = (i > GC 3 ple 1 >) }isasubgroupof Sy You 


could show that 4 is a normal subgroup by calculating the right and left cosets, 
but that is cumbersome and time consuming. It’s easier to proceed as follows. If 
c E S}, then by Exercise 20 of Section 7.4, c7! Ac is a subgroup of order 3. But 
A is the only subgroup of order 3 in S} (all the other nonidentity elements of 

S, have order 2, and hence, cannot be in a group of order 3 by Corollary 8.6). 
Therefore, we must have e7! Ac = A. Thus, A is a normal subgroup by part (5) 
of Theorem 8.11. 


E Exercises 


A. 1. Let K bea subgroup of a group Gand let a E G. Prove that aK = K if and only 
if ae K. 


2. Let K be the subgroup {ro, v} of D} Show that r, = t (mod K) andr, = h 
(mod K), but r,° r2 £ t° h (mod K). 


3. Prove that N = {rg, ři, r'a r3} is a normal subgroup of D, by listing all its right 
and left cosets. 
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4. If Gis a group, show that (e) and G are normal subgroups. 


b 
5. (a) Prove that G = (h a a, b, dER and ad + o} is a group under 
matrix multiplication and that N = l i be R} is a subgroup of G. 


(b) Use Theorem 8.11 to show that N is normal in G. 


3\ (1 
6. Provethat C i A ( 4 sy} is a subgroup of S, but not normal. 


123 
7. Let Gand H be groups. Prove that G* = {(a, e) |a E G} isa normal subgroup 
of GX H. 
8. (a) List all the cyclic subgroups of the quaternion group (Exercise 16 of 
Section 7.1). 


(b) Show that each of the subgroups in part (a) is normal. 
9. Let N be a subgroup of a group G. Suppose that, for each aE G, there exists 
be Gsuch that Na = bN. Prove that N is a normal subgroup. 
10. If Gis a group, prove that every subgroup of Z(G) is normal in G. [Compare 
with Exercise 14.] 


11. A subgroup N of a group Gis said to be characteristic if AN) & N for every 
automorphism fof G. Prove that every characteristic subgroup is normal. 
(The converse is false, but this is harder to prove.) 


12. Prove that for any group G, the center Z(G) is a characteristic subgroup. 


13. Let N be a subgroup of a group G. Prove that N is normal if and only if 
SW) = N for every inner automorphism f of G. 

14. Show by example that if M is anormal subgroup of Nand if N is anormal 
subgroup of a group G, then M need not be a norma! subgroup of G; in 
other words, normality isn’t transitive. [Hint: Consider M = {v, ro} and 
N = fh, v, ro, ro} in Da] 

15.* Prove that A, is a normal subgroup of S,. [Hint: If o E S, and T E Ap is 

o`! ta even or odd? See Example 7 of Section 7.5.] 


B. 16. If Kis a normal subgroup of order 2 in a group G, prove that K & Z(K). 
[Hint: If K = {e, k} and a E G, what are the possibilities for aka~'7] 
17. Let f:G —> H be a homomorphism of groups and let K = {ae G| f(a) = ep}. 
Prove that K isa normal subgroup of G. 


18. If K and N are normal subgroups of a group G, prove that K N Nis a normal 
subgroup of G. 


19. Let N and K be subgroups of a group G.If Nis normal in G, prove that N N K 
is a normal subgroup of K. 


20. (a) Let N and K be subgroups of a group G. If N is normal in G, prove that NK = 
{nk |neEN, ke K, is a subgroup of G. [Compare Exercise 26(b) of Section 7.3.] 


(b) If both Nand K are normal subgroups of G, prove that NK is normal. 


*Skip this exercise if you haven't read Section 7.5. 
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21. 


22. 


23. 


24. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34, 


Normal Subgroups and Quotient Groups 


If K and N are normal subgroups of a group G such that K N N = (e), prove 
that nk = kn for every nE N, KEK. 


If f:G— H is a surjective homomorphism of groups and if N is a normal 
subgroup of G, prove that /{N) is a normal subgroup of H. 


Let N be a subgroup of a group G of index 2. Prove that N is a normal 
subgroup as follows. 


(a) If a ¢ N, prove that the coset Na consists of all elements of G that are 
notin N. 


(b) For each aE G, prove that a~!Na € N and apply Theorem 8.11. [Hint: If 
a ¢ NandnEN, a nais either in N or in Naby part (a). Show that the 
latter possibility leads to a contradiction.] 


Let N = {AE GL(2, R)| det A EQ}. Prove that N is a normal subgroup of 
GL(2, R). [Hint: Exercise 32 of Section 7.4.] 


. Prove that SL(2, R) is a normal subgroup of GL(2, R). [Hint: SL(2, R) is 


defined in Exercise 23 of Section 7.1 Use Exercise 17 above and Exercise 32 of 
Section 7.4.] 


Let H be a subgroup of order n in a group G. If H is the only subgroup of 
order n, prove that H is normal. [Hint: Theorem 8.11 and Exercise 20 in 
Section 7.4,] 


Prove that a subgroup N of a group G is normal if and only if it has this 
property: ab € N if and only if ba EN, for all a, b E G. 


Prove that the cyclic subgroup (a) of a group Gis normal if and only if for 
each g € G, ga = atg for some keZ. 

Let N be a cyclic normal subgroup of a group G, and H any subgroup of N. 
Prove that H isa normal subgroup of G. [Compare Exercise 14.] 

Let A and B be normal subgroups of a group G such that A N B = (e) and 
AB = G (see Exercise 20). Prove that A X B = G. [Hint: Define fA X B > G 
by f(a, b) = ab and use Exercise 21.] 


Let H be a subgroup of a group Gand let N(H) be its normalizer (see 
Exercise 39 in Section 7.3). Prove that 


(a) H is a normal subgroup of M(H). 
(b) If H isa normal subgroup of a subgroup K of G, then KS N(H). 
Prove that Inn Gis a normal subgroup of Aut G. [See Exercise 37 of Section 7.4.] 


Let T be a set with three or more elements and let A(T) be the group of all 
permutations of T. If ae T, let H, = {fe A(T)| f(a) = a}. Prove that H, isa 
subgroup of A(7) that is not normal. 


Let G be a group that contains at least one subgroup of order n. Let N = NK, 
where the intersection is taken over all subgroups K of order n. Prove that N is 
anormal subgroup of G. [Hint: For each a€ G, verify that a~'Na = Na™'Ka, 
where the intersection is over all subgroups K of order n; use Exercise 20 of 
Section 7.4.] 
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35. Let H be a subgroup of a group G and let N = (_)a™'Ha. Prove that Nisa 
normal subgroup of G. ee 


36. If Mis a characteristic subgroup of N and N is a normal subgroup of a group 
G, prove that M is a normal subgroup of G. [See Exercise 11.] 


37. Let G be a group all of whose subgroups are normal. If a, b € G, prove that 
there is an integer k such that ab = bæ. 


83 Quotient Groups 


Let N be a normal subgroup of a group G. Then 
GIN denotes the set of all right cosets of N in G. 


Our first goal is to define an operation on right cosets so that G/N becomes a group. 
Since right cosets are congruence classes, our experience with Z and other rings suggests 
that it would be reasonable to define such an operation as follows: The product of the 
coset Na (the congruence class of a) and the coset Nb (the congruence class of b) is the 
coset Nab (the congruence class of ab). In symbols, this definition reads 


(Na)(Nb) = Nab. 


As in the past, we must verify that the definition does not depend on the elements 
chosen to represent the various cosets, and so we must prove 


Theorem 8.12 


Let N be a normal subgroup of a group G. If Na = Nc and Nb = Nd in G/N, 
then Nab = Ned. 


Proof» Na = Ne implies that a = c (mod N) by Theorem 8.2, similarly, Nb = Nd 
implies that b = d(mod N). Therefore, ab = cd(mod N) by Theorem 8.10. 
Hence, Nab = Nedby Theorem 8.2. E 


Theorem 8.13 


Let N be a normal subgroup of a group G. Then 


(1) G/N is a group under the operation defined by (Na)(Ne) = Nac. 
(2) If G is finite, then the order of G/N is |G|/V} 
(3) If G is an abelian group, then so is G/N. 


The group G/N is called the quotient group or factor group of G by N. 
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Proof of Theorem 8.13 » (1) The operation in G/N is well defined by Theorem 8.12. 
The coset N = Ne is the identity element in G/N since (Na)(Ne) = 
Nae = Na and (Ne)(Na) = Nea = Na for every Na in G/N. The inverse 
of Na is the coset Na~! since (Na)(Na~') = Naa“ = Ne and, similarly, 
(Na™!)(Na) = Ne. Associativity in G/N follows from that in G: 


[(Na)(Nb)\(Ne) = (Nab)(Nc) = N(ab)c = Na{bc) = (Na)(Nbc) 
= (Na)[(Nb)(No)]. 
Therefore, G/N is a group. 


(2) The order of G/N is the number of distinct right cosets of N, that is, 
the index [G:N]. By Lagrange’s Theorem, [G:N] = |G|/|M. 
(3) Exercise 11. m 


EXAMPLE 1 
In Example 2 of Section 8.2 we saw that N = (ro, ři, 72, r3} is a normal sub- 
group of D,. The operation table for D,in Example 1 of Section 8.2 shows that 
Nro = {ro ° Fos F1 ° Fo, 12° Fo, 73° Fo} = {Fo Fi, F2, Fs} 
Nv = {r90 v, rie v, r20 v, r30 v} = {v, d, h, t}. 


Since every element of D,is in either Nro or Nv and since any two cosets of N are 
either disjoint or identical (Corollary 8.3), every coset of N must be equal to Nro 
or Nv. In other words, D,/N = {Nro, Nv}. Since ro ° v = v = vo ro and v ° v = ro, 
the operation table for the quotient group D4/N is 


By Theorem 8.7, D,/N is isomorphic to the additive group #. 
EXAMPLE 2 


In Example 5 of Section 8.2 we saw that M = {ro, r2} is a normal subgroup of D,. 
Using the operation table for D,, we find that D,/M consists of these four cosets: 


Mr = fro r} = Mn Mr, = $ru r} = Mr, 
Mh = fh, v} = Mv Md = {d, t} = Mt. 


We shall choose one way of representing each coset and list the elements of D,/M 
as Mro, Mr;, Mh, and Md. When we compute products in D4/M, we express the 
answers in terms of these four cosets. For instance, since d ° r} = vin D,, we have 
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(Md)(Mr\) = M(d°r,) = Mv; but Mv = Mh, so we write (Md)(Mr,) = Mh in the 
table below. You should fill in the missing entries: 


| Mo Mr, Mh Md 
Mr, Mro Mr, Mh Md 
Mr, Mr, Mro Md 
Mh | Mh Md Mr, 

Ma Mad Mh 


The completed tabel shows that D,/M is an abelian group in which every nonidentity 
element has order 2 (Exercise 3). So D,/M is not cyclic. Hence, D,/M is isomorphic 
to Z, X Z, by Theorem 8.8. 


Examples 3-7 deal with abelian groups. So every subgroup is normal. 


EXAMPLE 3 


In the additive group Zn, let N be the cyclic group (4) = 40, 4, 8}. These four 
cosets of N contain every element of Zz: 

N+0= {0,4,8 =N 

N+1= {1,5,9} 

N +2 = {2, 6, 10} 

N +3 = {3,7,11}. 
Hence, every coset is one of these four. For instance, 5 isin N + 1 and 5 is also 


in N + 5 (Why?). So the two cosets are not disjoint. Hence, N + 1 = N+ 5 by 
Corollary 8.3. Similarly, 


N+4=N+0 and N+6=N+2. 
Using these facts, we see that the addition table for Zı2/N is 


N+0 N+1 N+2 Nt+3 


N+0 N+0 N+1 N+2 Nt+3 
N+1 N+1 N+2 N+3 N+t+0 
N+2 N+2 N+3 N+0 N+! 
N+3 N+3 N+0 NEI NEZ 


Verify that N + 1 has order 4. So Z,./N is a cyclic group of order 4 and hence, is 
isomorphic to Z4 by Theorem 7.19. 
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EXAMPLE 4 


Let N be the cyclic subgroup ((1, 2)) of the additive group G = Z, X Z4. Since 
(1, 2) + (1, 2) = (0, 0), we see that N = {(0, 0), (1, 2)}. Consequently, G/N con- 
sists of these four cosets 


N + (0, 0) = {(0, 0), (1, 2)} = N + (1, 2) 
N + (1, 0) = {(1, 0), (0, 2)} = N+ (0, 2) 
N + (0, 1) = {(0, 1), (1, 3)} = N + (1, 3) 
N+ (1, 1) = {(1, 1), (0, 3)} = N + (0, 3) 


and has the following addition table: 


N + (0, 0) N + (1, 0) N + (0, 1) N+ (1, 1) 


N+(0,0) | N+(0,0) N+(1,0) N+(0,1) N+(1,1) 
N+(1,0) | N+(1,0) N+(0,0) N+(1,1) N+(0,1) 
N+(0,1) | N+(,1) N+(1,1) N+(1,0) N+(0,0) 
N+(1,1) | N+,  N+0,1) N+(0,0) N+(1,0) 


Use the table to verify that G/N is a cyclic group of order 4 generated by N + (0, 1). 
Therefore, G/N = Z, by Theorem 7.19. 


It is not always necessary (or even possible) to write out the operation table for a 
quotient group G/N in order to determine its structure, as was done in Examples 1—4. 


EXAMPLE 5 
By Theorem 2.10, the group U4 = {1, 3, 5, 9, 11, 13} and thus has order 6. Let 
U; 
M be the cyclic subgroup (13) = {1, 13}. Then |U; / M| = hal = £ = 3 by 


Theorem 8.13. Therefore, U,4/M is isomorphic to Z, by Theorem 8.7. 


EXAMPLE 6 


In the additive group Z, let K be the cyclic subgroup 
(4) ={0, +4, +8, +12,...}. 
As we saw in Example 1 of Section 8.1, a = b (mod 4) means a ~ be K. Hence, 


a = b (mod 4) if and only if a = b (mod K). 
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So the set of integers that are congruent to a modulo 4 (the congruence class 
[a]) is exactly the same as the set of integers that are congruent to a modulo K 
(the coset K + a). In other words, [a] = K + a. Arithmetic is the same in either 
notation: 


Ka + Kb = K(a + b) isthe same as [a] + [b] = [a+ b]. 


Therefore, Z/Kis the group of congruence classes modulo 4, that is, Z/K = Z,. The 
same argument works with any positive integer n in place of 4: 


If K is the cyclic subgroup (a) of Z, then Z/K = Z, 


EXAMPLE 7 


The subgroup Z of integers in the additive group Q of rational numbers is 
normal since Q is abelian. Example 4 of Section 8.1 shows that there are infi- 
nitely many distinct cosets of Z in Q. Consequently, the quotient group Q/Z is 
an infinite abelian group. Nevertheless, every element of Q/Z has finite order 
(Exercise 25). 


The Structure of Groups 


If N is a normal subgroup of a group G, then the structure of each of the groups N, 
G, and G/N is related to the structure of the others. If we know enough information 
about two of these groups, we can often determine useful information about the third, 
as illustrated in the following theorems. 


Theorem 8.14 


Let N be a normal subgroup of a group G. Then G/N is abelian if and only if 
abab” EN for all a, bEG. 


Proof» G/N is abelian if and only if 
Nab = NaNb = NbNa = Nba foralla,beG 
But Nab = Nbaif and only if (ab)(ba)“'! € N by Theorem 8.21; and 


(ab)(ba)"! = aba~'b—' by Corollary 7.6. Therefore, G/N is abelian if and 
only if aba `’b ~'e N foralla,beG. E 


If G is a group, Example 6 of Section 8.2 shows that its center Z(G) is a normal 
subgroup of G. 
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Theorem 8.15 


If G is a group such thatthe quotient group G/Z(G) is cyclic, then G is abelian. 


Proof» For notational convenience, denote Z(G) by C. Since G/C is cyclic, it 
has a generator Cd, and every coset in G/C is of the form (Cd)* = Ca* 
for some integer k. Let a and b be any elements of G. Since a = ea is in 
the coset Ca and since Ca = Cd’ for some i, we have a = cd’ for some 
cı EC. Similarly, b = ed’ for some ¢2 € C and integer j. Now d'd! = 
dit} = di" = d'd’, and c, and c, commute with every element of G by the 
definition of the center. Consequently, 


ab = (aded) = aedd = coed! = (end! eid’) = ba. 


Therefore, Gis abelian. E 


E Exercises 


1. Let N be the subgroup (4) of Zw. Find the order of 13 + N in the group 
Z/N. 


2. Let G be the subgroup (3) of Z, and let N be the subgroup (15). Find the order 
of 6 + N in the group G/N. 


3. Complete the table in Example 2 and verify that every nonidentity element of 
D,/M has order 2. 


123\ f123 12 3\). 
A. 4. N= li 2 z! h 3 $! (= 1 >) } is anormal subgroup of S; by 


Example 9 of Section 8.2. Show that S3/N = 2. 
. Show that Z13/M = Z6, where M is the cyclic subgroup (6). 
. Show that Z6/N = Z,, where N is the subgroup {0, 3}. 
. Show that U,,/(5) is isomorphic to 7, 


Let G = Z, X Z, and let N be the cyclic subgroup generated by (3, 2). Show 

that G/N = Z4. 

9. Let G = Z, X Z, and let N be the cyclic subgroup ((1, 1)). Describe the 
quotient group G/N. 

10. (a) Let M be the cyclic subgroup (0, 2)) of the additive group G = Z, X Z, 
and let N be the cyclic subgroup ((1, 2)), as in Example 4. Verify that M is 
isomorphic N. 

(b) Write out the operation table of G/M, using the four cosets M + (0, 0), 
M + (1,0), M + (0, 1), M + (1, 1). 


ot AN 
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(c) Show that G/M is not isomorphic to G/N (the operation table for G/N is 
in Example 4). Thus for normal subgroups M and N, the fact that M = N 
does not imply that G/M is isomorphic to G/N. 


11. If Nis a subgroup of an abelian group G, prove that G/N is abelian. 


12. If N is a normal subgroup of a group G and if x? N for every x E€ G, prove 
that every nonidentity element of the quotient group G/N has order 2. 


13. (a) Give an example of a nonabelian group G such that G/Z(G) is abelian. 
(b) Give an example of a group G such that G/Z(G) is not abelian. 


1234\/1234\ Age (1234 
e: ep Siom PS e A E oy Gmea Oh 


is anormal subgroup of S4. 
(b) Write out the operation table for the group S4/F. 


B. In Exercises 15 and 16, find an element of infinite order and an element of finite 
order in the given quotient greup. There are many correct answers. Remember that Z 
is an additive group. 


15. (Z x Z)/((5, 5)) 

16. @ x Z)/((6, 9)) 

17. Let E be the group of even integers and N the subgroup of all multiples of 8. 
(a) Show that E/N has order 4. 
(b) To what well-known group is E/N isomorphic? [Hint: Theorem 8.8.] 

18. Show that U3/N = Uj., where N is the subgroup {1, 17}. 


19. An element 4 of a group is said to be a square if there is an element c in the 
group such that b = c?. Let N be a subgroup of an abelian group G. If both 
N and G/N have the property that every element is a square, prove that every 
element of Gis a square. 

20. If Gis a group and [G:G/Z(G)] = 4, prove that G/Z(G) = Z, X Z4. 

21. Let G be an abelian group and T its torsion subgroup (see Exercise 19 of 
Section 7.3). Prove that G/T has no nonidentity elements of finite order. 


22. Let R* be the multiplicative group of nonzero real numbers and let N be the 
subgroup {1, —1}. Prove that R*/N is isomorphic to the multiplicative group 
R*+ of positive real numbers. 


23. Describe the quotient group R*/R**, where R* and R** are as in Exercise 22. 
24. If Gis acyclic group, prove that G/N is cyclic, where N is any subgroup of G. 


14 48 
9° 5° and > = 3" the additive group Q/Z, 


(b) Prove that every element of ae has finite order. 
(c) Prove that Q/Z contains elements of every possible finite order. 


25. (a) Find the order of = 
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26. Prove that the set of elements of finite order in the group R/Z is the subgroup 
Q/Z. 


27. Let G and H be groups and let G* be the subset of G X H consisting of all (a, e) 
with a E G. 


(a) Show that G* is isomorphic to G. 
(b) Show that G* is a normal subgroup of G X H. 
(c) Show that (G X H)/G* = H. 
28. Let M and N be normal subgroups of a group G such that M N N = (e). 
Prove that G is isomorphic to a subgroup of G/M X G/N. 


29. If N is a normal subgroup of a group G and if every element of N and of G/N 
has finite order, prove that every element of G has finite order. 


30. If N is a finite normal subgroup of a group G and if G/N contains an element 
of order n, prove that G contains an element of order n. 


31. Let G be a group of order pq, with p and q (not necessarily distinct) primes. 
Prove that the center Z(G) is either (e) or G. 


32. A group H is said to be finitely generated if there is a finite subset S of H such 
that H = (S) (see Theorem 7.18). If N is a normal subgroup of a group G 
such that the groups N and G/N are finitely generated, prove that G is finitely 
generated. 


33. Let G be a group and let 5 be the set of all elements of the form aba~15~! with 
a, b € G. The subgroup G’ generated by the set S (as in Theorem 7.18) is called 
the commutator subgroup of G. Prove 


(a) G’ is normal in G. [Hint: For any g, a, b € G, show that g "aba bg = 
(g ‘agg ‘bg (ga 'g)(g~'b"g) is in S.] 
(b) G/G’ is abelian. 
34. Let G be the additive group R X R. 
(a) Show that N = {(x, y)|y = —x} is a subgroup of G. 
(b) Describe the quotient group G/N. 
35. Let N be a normal subgroup of a group G and let G’ be the commutator 
subgroup defined in Exercise 33. If N N G' = (g), prove that 
(a) NS Z(G) (b) The center of G/N is Z(G)/N. 


36. If Gis a group, prove that G/Z(G) is isomorphic to the group Inn G of all 
inner automorphisms of G (see Exercise 37 in Section 7.4). 


C.37. Let A, B, N be normal subgroups of a group G such that NG A, NS B. If 


G = AB and AN B = N, prove that G/N = A/N X B/N. (The special case 
N = (e) is Exercise 30 in Section 8.2.) 
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84 Quotient Groups and Homomorphisms 


There is a close connection between normal subgroups, quotient groups, and homo- 
morphisms.* The following definition is crucial for developing this connection. 


Definition Let f:G — H be a homomorphism of groups. Then the kernel of f is the set 


fac & jra) = ey}. 


Thus, the kernel is the set of elements in G that are mapped onto the identity element 
in H by the homomorphism f. 


EXAMPLE 1 


Let R* be the multiplicative group of nonzero real numbers and R** the 
multiplicative group of positive real numbers. The function fR* > R** given 
by f(x) = x? is a homomorphism because f(ab) = (ab)? = ab? = f(a)f(b). Its 
kernel is the set of real numbers x such that x? = 1, namely, {1, —1}. 


EXAMPLE 2 


Verify that the function f/:R* x R* — R* given by f(a, b) = b isa homomor- 
phism of multiplicative groups. Its kernel is the set of all pairs (a, b) such that 
b = 1, that is, {(a, 1) |a E R*}. 


EXAMPLE 3 


In Example 13 of Section 7.4, we saw that the function f:Z — Z; given by 
f(a) = [a] is a homomorphism of additive groups. Its kernel is the set 


K ={aeZ | f(a) = [0}} = {ae Z |[a] = [0}}. 
But [a] = [0] if and only if a = 0 (mod 5) by Theorem 2.3, and a = 0 (mod 5) if 


and only if 5|a by the definition of congruence. Hence, K is the set of all integer 
multiples of 5, that is, the cyclic group (5). 


You can easily verify that each of the kernels in Examples 1-3 is actually a (normal) 
subgroup. The same thing is true in the general case. 


*If you have read Chapter 6, this should not come as a surprise. The first part of this section simply 
carries over to groups the facts about ideals, quotient rings, and ring homomorphisms that were 
developed at the end of Section 6.2. (pages 154-158). 
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Theorem 8.16 


Let f;G —- H be a homomorphism of groups with kernel K. Then K is a normal 
subgroup of G. 


Proof>ir c, dE K, then f(c) = ey and f(d) = ey by the definition of kernel. 
Hence, f(cd) = f(c) f(d) = eyey = ey, so that cde K. If ce K, then by 
Theorem 7.20 f(c!) = f(c)~! = (ep)! = ey. Thus 0'E K. Therefore, K 
is a subgroup of G by Theorem 7.11. To show that Kis normal, we must 
verify that for any a E G and ¢ E K, a ~'ca€ K (Theorem 8.11). However, 


faac) = (NOA) = fay enfla) = flay f(a) = ew 


Therefore, a~'cae Kand Kis normal. m 
EXAMPLE 4* 


Define f:S, — Z3 as follows: f(a) = 0 if g is even and f(¢) = 1 if g is odd. 
Then f is a homomorphism (Exercise 7). Clearly, the kernel of f‘consists of all 
even permutations, that is, the kernel is A,. By Theorem 8.16, A, is a normal 
subgroup of Sẹ- 


The kernel of a homomorphism /measures how far f is from being injective. 


Theorem 8.17 


Let £G — H be a homomorphism of groups with kernel K. Then 
= (ec) if and only if f is injective. 


Proof > suppose K = (eq). If f(a) = fb), then 


flab) = faye“) [f is a homomorphism] 
= f(af (by! [Part (2) of Theorem 7.20] 


= f@fa)' = ey [f(a) = f(b) by hypothesis. 


Thus, ad~ is in the kernel, so that ab“! = eg and hence, a = b. Therefore, 
fis injective. 

Conversely, suppose f is injective. If c is any element in the kernel K, 
then f(c) = ey. By part (1) of Theorem 7.20, f(eg) = eg. Hence, f(c) = 
f(eo), which implies that c = eg since f is injective. Therefore, eg is the 
only element of K, s0 K= (eg). M 


*Skip this example if you haven't read Section 7.5. 


tThe proofs of Theorems 8.17-8.20 are simply translations from rings to groups of tha proofs of 
Theorems 6.11-6.13. 
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Theorem 8.16 states that every kernel is a normal subgroup. Conversely, every 
normal subgroup is a kernel: 


Theorem 8.18 


If N is a normal subgroup of a group G, then the map m:G — G/N given by 
(a) = Na is a surjective homomorphism with kernel N. 


Proof The map 7 is surjective because given any coset Na in G/N, we have 
nla) = Na. The definition of the group operation in G/N shows that 77 is 
a homomorphism: 


at(ab) = Nab = NaNb = q(ayn(b). 
The identity element of G/N is Ne. So the kernel of m is 
{aeEG | (a) = Ne} = {ac G| Na = Ne} [Definition of m] 
= {aE G |a = e (mod N)} [Theorem 8.2] 
= {ae G|ae EN} [Definition of congruence] 
= {şaceG|aeN =N jae! = ae =a] E 


In order to prove the First Isomorphism Theorem below, we need this lemma. 


Lemma 8.19 


Let f:G — H be a group homomorphism with kernel K. Let a, be G. Then 
f(a) = f(b) if and only if Ka = Kb. 


Proof > 1f f(a) = f(b), then f(a) f(b)! =ey. By Theorem 7.20, 
flab) = FSE) = f(@f()"' = ep 
Hence, ab“! K and a = b (mod K). So Ka = Kb by Theorem 8.2. 


Conversely, suppose Ka = Kb. By Theorem 8.2, a = b (mod K), which 
means that ab € K. Hence, f(ab™') = ey, and by Theorem 7.20, 


SAS" = F@FO") = flab) = ep- 


Multiplying both ends on the right by f(b) shows that f(a) = f(b). W 
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Theorem 8.20 First Isomorphism Theorem 


Let f:G + H be a surjective homomorphism of groups with kernel K. Then the 
quotient group G/K is isomorphic to H. 


Proof» We would like to define ¢:G/K > H by (Ka) = f(a). However, a coset 
can be labeled by many different elements. We need to know that the 
value of g depends only on the coset, and not on the particular repre- 
sentative element chosen to name it. So suppose that Ka = Kb. Then 
f(a) = f(b) by Lemma 8.19, which means that (Ka) = (Kb). Therefore, 
the map 9:G/K — H given by (Ka) = f(a) is a well-defined function, 
independent of how cosets are written. 

To prove that ọ is surjective, suppose A € H. Then # = f(c) for some 
c E G because f is surjective. Thus, ¢(Ke) = f(e) = h, and ¢ is surjective. 
To prove that ¢ is injective, suppose ¢(Ka) = ¢( Kb). Then f(a) = f(b), 
so that Ka = Kb by Lemma 8.19. Hence, is injective. Finally, g is a 
homomorphism because fis 


¢(KaKb) = g(Kab) = f(ab) = fla)f(b) = (Ka) (Kb). 
Therefore, p:G/K — His an isomorphism. E 


The First Isomorphism Theorem makes it easier to identify certain quotient groups. 


EXAMPLE 5 


Let G and H be groups and define iG X H > G by f(a, b) = a. Then fis a 
surjective homomorphism by Exercise 9 of Section 7.4. The kernel of fis 


H = {(a, b) | f(a, b) = ea} = {(a, b) | a = eg} = (ea, b) | ae H}. 


By the First Isomorphism Theorem, (G X H)/H = G, and it is easy to show 
that H is isomorphic to H (Exercise 15). 


EXAMPLE 6&6 


The function f:C* + R** given by f(a + bi) = a? + # is a surjective homo- 
morphism of multiplicative groups (Exercise 16). Since 1 is the identity in R**, 
the kernel of fis N = {a + bi|a’ + b? = 1}. Then ~N is a normal subgroup by 
Theorem 8.16 and C*/N = R** by the First Isomorphism Theorem. 


EXAMPLE 7 


As we saw in Example 1, the function f:R* + R** given by f(x) = xis a 
homomorphism with kernel K = {1, —1}. Note that fis surjective because 
for any positive real number c, f (Vc) = (Ve)? = c. By the First Isomorphism 
Theorem, R*/K = R**. 
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Subgroups of Quotient Groups 


Let N be a normal subgroup of a group G. We now investigate the subgroups of the 
quotient group G/N. 


Theorem 8.21 


Let N be a normal subgroup of a group G and let K be any subgroup of G that 
contains N. Then K/N is a subgroup of G/N. 


Proof > N is obviously a subgroup of K. By normality, Na = aN for every a E G. 
In particular, Na = aN for every aE K. Hence, N is a normal subgroup 
of K and K/N isa group by Theorem 8.13. The elements of K/N are the 
cosets Na with a € K. Since, every such coset is an element of G/N, we 
conclude that K/N is a subgroup of G/N. W 


When K is a normal subgroup of G, we get a stronger result. 


Theorem 8.22 Third lsomorphism Theorem* 


Let K and N be normal subgroups of a group G with NC K&G.Then K/Nisa 
normal subgroup of G/N, and the quotient group (G/N)/(K/N) is isomorphic 
to G/K. 


Proof > The basic idea of the proof is to define a surjective homomorphism 
from G/N to G/K whose kernel is K/N. Then the conclusion of the 
theorem will follow immediately from the First Isomorphism Theorem. 
First note that, if Na = Ne in G/N, then ac! € N by Theorem 8.2 
and the definition of congruence modulo N. Since NS K, this means 
that. ac~' E K. Consequently, Ka = Kc in G/K by Theorem 8.2 again. 
Therefore, the map f :G/N — G/K given by /(Na) = Kais a well-defined 
function, that is, independent of the coset representatives in G/N. 
Clearly f is surjective since any Kain G/K is the image of Na in G/N. 
The definition of coset operation shows that 


{(NaNb) = f(Nab) = Kab = KaKb = f (Naf(Nb). 


Hence, f is a homomorphism. Since the identity element of G/K is Ke, 

a coset Na is in the kernel of fif and only if f(Na) = Ke, that is, if and 
only if Ka = Ke. However, Ka = Ke if and only if a € K by Theorem 8.2. 
Thus the kernel of f consists of all cosets Na with a E K; in other words, 
K/N is the kernel of f. Therefore, K/N is a normal subgroup of G/N 
(Theorem 8.16), and by the First Isomorphism Theorem, (G/N)/(K/N) = 
(G/N)/kernel f= G/K. m 


*Yes, Virginia, there is a Second Isomorphism Theorem; see Exercise 40. For more about Virginia, go 
to www.stormfax.com/bios. htm 


Coppi 2012 Canga Lamaie, AN Cig Gamera May sot be copied, wasna. or Angticmd, in whois ar in part. Dus to elecranic cights, mme third panty contest may be mppmmal fren the eBook astor eChapter(s). Ediawial eeview hes 
that noy capgressed matsat dows oot exatecially affect tha oveali Laning experience Crmgege Learning manrves ths right to recevve ef@donal cocinst at any time if subsspem ripths convictions faqeire it. 


268 Chapter 8 Normal Subgroups and Quotient Groups 


Corollary 8.23 


Let N be a normal subgroup of a group G and let K be any subgroup of G that 
contains N. Then K is normal in G if and only if K/N is normal in G/N. 


Proof» tf X is normal in G, then K/N is normal in G/N by Theorem 8.22. 
Conversely, suppose that K/N is normal in G/N. Let a be any element of 
Gand k any element of K. We first prove that a™’ka E K. Since K/N 
is normal, 


Na™'ka = (Na™)(Nk)(Na) = (Na)“(\Nk)(Na) € K/N. 


Hence, Na`!ka = Nt for some tE K, so that a~'ka = nt for some nE N. 
Since N & K, we have a~!ka = nt E K, as desired. Since a and k were 
arbitrary, this proves that a~'Ka S K. Therefore, K is normal in G by 
Theorem 8.11. E 


We now have complete information about subgroups of G/N that arise from 
subgroups of G that contain N. Are these the only subgroups of G/N? The next 
theorem answers this question in the affirmative. 


Theorem 8.24 


lf 7 is any subgroup of G/N, then 7 = H/N, where H is a subgroup of G that 
contains N. 


Proof» Let H = {ae G|NaeT}. Exercise 23 shows that H is a subgroup of G. 
If aEN, then ae! = ae = a EN, so a = e (mod N). By Theorem 8.2, 
Na = NeeT. Hence, a € H. Therefore, N& H. Finally, the quotient 
group H/N consists of all cosets Na with a € H, that is, all Nac T. Thus, 
H/N=T. @ 


Simple Groups 

In Section 8.1 we considered the classification problem for finite groups—the attempt 
to produce a list of groups such that every finite group is isomorphic to exactly one 
group on the list. We now introduce the groups that apparently are the key to solving 
the classification problem. Recall that a group G always has two normal subgroups, 
the trivial group (e) and G itself (Exercise 4 in Section 8.2). A group G is said to be 
simple if its only normal subgroups are (e) and G. 


EXAMPLE 8 


If pis prime, then any (normal) subgroup H of the additive group Z, must have 
order dividing p by Lagrange’s Theorem. So H must have order 1 or p, so that 
H = (0) or H = @,. Therefore, Z, is simple. 
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Theorem 8,25 


G is a simple abelian group if and only if G is isomorphic to the additive group 
Z, for some prime p. 


Proof» The preceding example shows that any group isomorphic to Z, is 
simple. Conversely, suppose Gis simple. Since every subgroup of an 
abelian group is normal, G has no subgroups at all, except (e) and G. 
So if a is any nonidentity element of G, then the cyclic subgroup (a) 
must be G itself. Since every infinite cyclic group is isomorphic to Z by 
Theorem 7.19 and Z has many proper subgroups, G = (a) must bea 
cyclic group of finite order n. We claim that » is prime. If n were com- 
posite, say n = td with 1 < d < n, then (a) would be a subgroup of G 
of order d by part (3) of Theorem 7.9, which is impossible since G is 
simple. Therefore, G is cyclic of prime order and, hence, is isomorphic 
to some Z, by Theorem 7.19. E 


Nonabelian simple groups are relatively rare. There are only five of order less than 
1000 and only 56 of order less than 1,000,000. A large class of nonabelian simple 
groups, the alternating groups, is considered in Section 8.5. 

We now show why simple groups are the basic building blocks for all groups. If G 
is a finite group, then it has only finitely many normal subgroups other than itself (and 
there is at least one such subgroup since (e) is normal). Let G, be a normal subgroup 
(other than G) that has the largest possible order. We claim that G/G, is simple. If 
G/G, had a proper normal subgroup, then by Theorem 8.24 and Corollary 8.23 this 
subgroup would be of the form M/G,, where M is a normal subgroup of G such that 
G, G M G G. In this case, M would be a normal subgroup other than G with order 
larger than |G\|, a contradiction. Hence, G/G, is simple. 

If G, # (e), let G, be a normal subgroup of G; (other than G,) of largest possible 
order. (Gz is normal in G,, but need not be normal in G.) The argument in the preced- 
ing paragraph, with G, in place of Gand G, in place of G,, shows that G,/G, is simple. 
Similarly, if G, + (e), there is anormal subgroup G; of G, such that G, # G, and G,/G, 
is simple. This process can be continued until we reach some G, that is the identity 
subgroup (and this must occur since the order of G, gets smaller at each stage). Then 
we have a sequence of groups 


G=G26,26242°°:2 G,-1 2 G, = (e) 


such that each G, is a normal subgroup of its predecessor and each quotient group 
G,/G;4, is simple. The simple groups Go/G;, G1/ Ga, . > > , Ga-1/ Gn are called the 
composition factors of G. 

It can be shown that the composition factors of a finite group G are independent 
of the choice of the subgroups G,. In other words, if you made different choices of 
the G,, the simple quotient groups you would obtain would be isomorphic to the ones 
obtained in the previous paragraph. This means that the composition factors of G are 
completely determined by the structure of G and suggests a strategy for solving the 
classification problem. If we could first classify all simple groups and then show how 
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the composition factors of an arbitrary group determine the structure of the group, it 
would be possible to classify all groups. 

The good news is that the first half of this plan has already succeeded. For more 
than four decades, a number of group theorists around the world worked on various 
aspects of the problem and eventually obtained a list of simple groups such that every 
finite simple group is isomorphic to exactly one group on the list.* The complete proof 
of this spectacular result runs some 10,000 pages! For a brief history of the search for 
simple groups, see Gallian [23] or Steen [25]. 


E Exercises 


NOTE: The congruence class of a in Z, is denoted [a], whenever necessary to avoid 
confusion. 


A. In Exercises 1-9, verify that the given function is a homomorphism and find its 
kernel. 


1. iC > R, where f(a + bi = b. 
2. g:R* > Z3, where g(x) = 0 if x > 0 and g(x) = 1 if x < 0. 
3. h: R*  R*, where h(x) = xX. 
4. fi Q* > Q**, where f(x) = | x} 
5. 9:Q X Z >Z, where f(x, y)) = Y- 
6. h:C > C, where h(x) = x*. 
71 f:S,—> Za, where f(a) = 0 if g is even and f(a) = 1 if g is odd. 
8. f: Zi2 —> Zin where f(x) = 3x. 
9. fZ— Z, X Z4, where f(a) = (lal, [ala)- 
10. @:S, > S,,.1, where for each f E Sm (f) €S,41 is given by 


ifflsksr 


PUNE Cr ifk=n+1 


n+l 


11. Suppose that k, n, and r are positive integers such that k |n. Show that the 
function fZ, — Z; given by f([a],,) = [ra]; is well defined (meaning that if 
[a], = [4],; then [ra], = [r5],). 


“The proof was first announced in 1981, but a few years later a gap in the proof was discovered. It 
took until 2004 for this gap to be fixed. 


Skip this exercise if you haven't read Section 7.5. 
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In Exercises 12-14, verify that the given function is a surjective homomorphism of 
additive groups. Then find its kernel and identify the cyclic group to which the kernel 
is isomorphic. [Exercise 11 may be helpful. ] 


12. A:Z,.—> Z6, where (faj) = [a]g. 

13. A:Z1¢ —> £4, where h([a}is) = [3a]. 

14. h:Zig > Za, where h([x];,) = [2x]. 

15. If Hand H are the groups in Example 5. Show that H = H. 


16. Prove that the function f: C* — R** given by f(a + bi) = a + bis a surjective 
homomorphism of groups. 


17. (a) Produce a list of groups such that every homomorphic image of Z4, is 
isomorphic to exactly one group on the list. [Hint: See Exercise 26 in 
Section 7.4.] 


(b) Do the same for Z,- 

18. Find all homomorphic images of D4. 

19. Find all homomorphic images of 53. 

20. (a) List all subgroups of Z,,/H, where H = {0, 6}. 
(b) List all subgroups of 2)/K, where K = {0, 4, 8, 12, 16}. 

21. Suppose that Gis a simple group and f:G —> H is a surjective homomorphism 
of groups. Prove that either fis an isomorphism or H = {e}. 

B.22. Let G bean abelian group. 

(a) Show that K = {a€ G |ja| = 2} is a subgroup of G. 
(b) Show that H = {x*]x € G} isa subgroup of G. 


(c) Prove that G/K = H. [Hint: Define a surjective homomorphism from G to 
H with kernel K.] 


23. If Nis anormal subgroup of a group G and T is a subgroup of G/N, show 
that H = {a€G| Na E T} is a subgroup of G. 

24. If k|n and fU, —> U; is given by f([x],) = [x],, show that fis a homomorphism 
and find its kernel. 

25. Prove that (Z X Z)/{(1, 1)) = Z. [Hint: Show that fZ x Z — Z, given by 
Sa, b)) = a — b, is a surjective homomorphism.] 

26. Prove that Z X 2)/{(2, 2)) = Z X Z,. [Hint: Show that #:Z X Z >Z X Z, 
given by A((a, b)) = (a — b, [b],) is a surjective homomorphism.] 

27. Let M be a normal subgroup of a group Gand let N be a normal subgroup 
of a group H. Use the First Isomorphism Theorem to prove that M X Nisa 
normal subgroup of G X H and that (G X H)/(M X N) = G/M X H/N. 

28. SL{2, R) is a normal subgroup of GL(2, R) by Exercise 25 of Section 8.2. 
Prove that GL(2, R)/SL(2, R) is isomorphic to the multiplicative group R* of 
nonzero real numbers. 


29. If k|n, prove that Z,/(k) = Z,. [Exercise 11 may be helpful.] 
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30. If fG — His a homomorphism of finite groups, prove that |Im f| divides] G| 
and |H}. [Im f was defined just before Theorem 7.20.] 


31. Prove that Z,, = Z, X Z4. [Consider fZ > Z, X Z4 given by f(a) = ({a]3, [a]4)-] 
32. Let M be a normal subgroup of a group Gand let N be a normal subgroup of 


a group H. If f:G—+ H is a homomorphism such that f(M) & N, prove that the 
map g:G/M-—> H/N given by g(Ma) = Nf(a) is a well-defined homomorphism. 


33. Let fG — H be a surjective homomorphism of groups with kernel K. Prove 
that there is a bijection between the set of all subgroups of H and the set of 
subgroups of G that contain K. 


34. (An exercise for those who know how to multiply 3 X 3 matrices.) Let G be 
the set of all matrices of the form 


oo} 
ore à 
=a Oo & 


where a, b, c EQ. 
(a) Show that G is a group under matrix multiplication. 


(b) Find the center C of G and show that C is isomorphic to the additive 
group Q. 


(c) Show that G/C is isomorphic to the additive group Q x Q. 


35. Let G and H be the groups in Exercises 33 and 34 of Section 7.1. Use the 
First Isomorphism Theorem to prove that H is normal in G and that G/H is 
isomorphic to the multiplicative group R* of nonzero real numbers. 

[Hint: Consider the map f:G —> R* given by f(T) = a.] 


36. Let N be a normal subgroup of a group G and let f:G — H be a 
homomorphism of groups such that the restriction of f to N is an 
isomorphism N = H. Prove that G = N X K, where K is the kernel of f. 
[Hint: Exercise 30 in Section 8.2.] 


37. Prove that Q* = Q** X Z, [Hint: Exercises 4 and 36.] 


38. Let N be a normal subgroup of a group G. Prove that G/N is simple if and 
only if there is no normal subgroup K such that N G K G G. 
[Hint: Corollary 8.23 and Theorem 8.24.] 


39.* The additive group Z[x] contains Z (the set of constant polynomials) as a 
normal subgroup. Show that Z[x]/Z is isomorphic to Z[x]. This example 
shows that G/N = G does not necessarily imply that N = (e). [Hint: Consider 
the map T:Z[x] > Z[x]/Z given by 7( f(x) = Z + xf(x).] 

C.40. (Second Isomorphism Theorem) Let K and N be subgroups of a group G, with 
N normal in G. Then NK = {nk |n EN, k E K} is a subgroup of G that contains 
both K and N by Exercise 20 of Section 8.2. 


(a) Prove that N is anormal subgroup of NK. 


*Skip this exercise if you have not read the first part of Section 4.1. 
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(b) Prove that the function f:K + NK/N given by f(k) = Nk is a surjective 
homomorphism with kernel KN N. 


(c) Conclude that K/(N A K) = NK/N. 
41. Cayley’s Theorem 7.21 represents a group Gas a subgroup of the permutation 
group A(G). A more efficient way of representing G as a permutation group 


arises from the following generalized Cayley’s Theorem. Let K be a subgroup 
of Gand let T be the set of all distinct right cosets of K. 


(a) If a € G, show that the map f} T — T given by f (Kb) = Kbaisa 
permutation of the set T. 


(b) Prove that the function ¢:G— A(T) given by g(a) = fa, isa 
homomorphism of groups whose kernel is contained in K. 
(c) If K is normal in G, prove that K = kernel g. 
(d) Prove Cayley’s Theorem by applying parts (b) and (c) with K = (e). 
42. A group Gis said to be metabelian if it has a subgroup N such that Nis 
abelian, N is normal in G, and G/N is abelian. 
(a) Show that S, is metabelian. 
(b) Prove that every homomorphic image of a metabelian group is metabelian. 
(c) Prove that every subgroup of a metabelian group is metabelian. 


APPLICATION: Decoding Techniques (Section 16.2) may be covered at 
this point if desired. 


85 The Simplicity of A,,* 


As we saw at the end of Section 8.4, simple groups appear to be the key to solving the 
classification problem for finite groups. This fact and the following theorem are one 
reason that the alternating groups 4, are important. 


Theorem 8.26 


For each n # 4, the alternating group A, is a simple group. 


The group Ag is not simple (Exercise 7). Although the entire proof of Theorem 8.26 
is rather long, it requires only basic facts about the symmetric groups and normal 
subgroups. There will be many instances in the proof where we will deal with permuta- 
tions such as (abcd) or (a2) or (ab)(cd). In all such cases, 

distinct letters represent distinct elements of {1,2,..., 2}. 


The proof of the theorem requires two lemmas. 


*Section 7.5 is a prerequisite. This section is not used in the sequel and may be omitted if desired. 
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Lemma 8.27 


Every element of A, (with n = 3) is a product of 3-cycles. 


Proof» Every element of A, is by definition the product of pairs of transposi- 
tions. But every such pair must be of one of these forms: (ab) (ed) or 
(ab) (ac) or (ab) (ab). In the first case verify that (ab) (ed) = (adb) (ade), 
in the second that (ab) (ac) = (acb), and in the last that (ab) (ab) = (1) = 
(abc) (acb). Thus every pair of transpositions is either a 3-cycle or a 
product of two 3-cycles. Hence, every product of pairs of transpositions 
is a product of 3-cycles. E 


Lemma 8,28 


If N is a normal subgroup of A, (with n = 3) and N contains a 3-cycle, then 
N = An 


Proof» For notational convenience, assume that (123) € N [the argument when 
(rst)E N is the same; just replace 1, 2, 3 by r, s, t, respectively]. Since 
(123) EN, we see that (123)(123) = (132) is also in N. For k = 4, let 
x = (12)(3k) and verify that x~! = (3k)(12). The normality of N implies 
that x(132)x7! € N by Theorem 8.11. But 


x(132)x7! = (12)(3K)(132)(3k)(12) = (12k). 
Therefore, 
(*) N contains all 3-cycles of the form (12k) with k = 3. 
Verify that every other 3-cycle can be written in one of these forms: 
(la2), (lab),  (2ab), (abe) 
where a, b, c = 3. By (+) and closure in N, 
(la2) = (12a)(12a) E N; 
(lab) = (12b)(124a)(12a) E€ N; 
(2ab) = (12b)(12b)(12a) E N; 
(abe) = (12a)(12a)(12c)(12)(125)(12a) E N. 


Thus N contains all 3-cycles, and, hence, N contains all products of 
3-cycles by closure. Therefore, N = A, by Lemma 8.27. E 


We are now ready to prove Theorem 8.26. The following fact will be used frequently: 
(++) The inverse of the cycle (a,a,a@, + + + a,) is the cycle (a,a,a, ; * * * a343). 


For example, (12345)! = (15432) and (678)! = (687), as you can easily verify. 
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Proof of Theorem 8.26 » A, and A; are simple abelian groups (Exercise 2). So 

assume n = 5. We must prove that A, has no proper normal subgroups. 
Let N be any normal subgroup of A,, with N # (1). We need only 
show that N = A,. When all the nonidentity elements of N are written 
as products of disjoint cycles, then there are three possibilities for the 
lengths of these cycles: 

1. Some cycle has length = 4. 

2. Every cycle has length <£ 3, and some have length 3. 


3. Every cycle has length =£ 2. 


We shall show that in each of these cases, N = A,. 


Case 1 N contains an element g that is the product of disjoint cycles, at least 
one of which has length r = 4. For notational convenience we assume that 
g = (1234 - - + r)T, where 7 is a product of disjoint cycles, none of which 
involve the symbols 1, 2,3, 4, . . . , r.t Let 6 = (123) A,. Since N is a 
normal subgroup and g E N, we have g~"(5g'5~!) € N by Theorem 8.11. 
An easy computation shows that 
ao '(Sa8~!) = [(1234 -  « ry}! (123)[(1234 - > + r)rY(123)7! 
= 7"(1234 + + = )1(123)[(1234 - - + )T\(123)™ [Corollary 7.6] 
= 7 "(Ir + + + 432)(123)(1234 - + + r)r(132) [Statement (+*)] 
= 7 'r(Ir+ + + 432)(123)(1234 + + + r)(132) [Theorem 7.23] 


= ()(13r) = (13r). 
Therefore, (13r)€ N,and hence, N = A, by Lemma 8.28. 


Case 2A N contains an element ø that is the product of disjoint cycles, at least 
two of which have length 3. For convenience we assume that ¢ = 
(123)(456) T, where T is a product of disjoint cycles, none of which in- 
volve the symbols 1, 2,..., 6. Let 5 = (124) €A,. Then, as in Case 1, 
N contains ¢~'(8g5~'), and we have a similar calculation: 


a~ (êr) = [(123)(456)r]1(124)(123)(456)r(124)! 
= 77'(456)'(123)-'(124)(123)(456)7(124)"! [Corollary 7.6] 


= 7~'(465)(132)(124)(123)(456)r (142) [Statement (#*)] 
= 77 '7(465)(132)(124)(123)(456)(142) [Theorem 7.23] 
= (14263). 


Therefore, (14263) € N, and N = A, by Case 1. 


tThe same argument works with an arbitrary r-cycle (abcd -- - t) in place of (1234 - -- r); just replace 
1 by a, 2 by b, etc. Analogous remarks apply in the other cases, where specific cycles will also be 
used to make the argument easier to follow. 
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Case 2B N contains an element ø that is the product of one 3-cycle and some 
2-cycles. We assume that g = (123)r, where q is a product of disjoint 
transpositions, none of which involve the symbols 1, 2, 3. Sincea 
product of disjoint transpositions is its own inverse (Exercise 5), 
Theorem 7.23 shows that 


= (123)r(123)r = (123)(123)rr = (123)(123) = (132). 
But o7€ Nsinceg EN. Therefore, (132) EN, and N = A, by Lemma 8.28. 


Case 2C N contains a 3-cycle. Then N = A, by Lemma 8.28. 


Case 3 Every element of N is the product of an even number of disjoint 
2-cycles. Then a typical element g of N has the form (12)(34)7, where 
T is a product of disjoint transpositions, none of which involve the 
symbols 1, 2, 3, 4. Let 8 = (123)€ A,. Then, as above, o'(6a8~') EN. 
Using Corollary 7.6, Theorem 7.23, and statement (+#), we see that 


a~o") = 7~*(34)(12)(123)(12)(34)7(132) = (13)(24). 


Since n = 5, there is an element k in {1, 2, . . . , n} distinct from 1, 2, 3, 4. 
Let a = (13k) E'A, Let B = (13)(24), which was just shown to be in N. 
Then by the normality of N and closure, B(aBa~') EN. But 


Bapa) = (13)(24)(13K)(13)(24)(1k3) = (13k). 


Therefore, (13k) € N, and N = A, by Lemma 8.28. E 


Theorem 8.26 leads to an interesting fact about the normal subgroups of Sẹ: 


Corollary 8.29 


If n = 5, then (1), An and S, are the only normal subgroups of Sp. 


Sketch of Proof » Suppose that N is a normal subgroup of S,,. Then N N A, is 
a normal subgroup of A, (Exercise 19 of Section 8.2). Theorem 8.26 
shows that N N A, must either be A, or (1). If N N A, = A,, then N = A, 
or S, (Exercise 10). If N N 4, = (1), then all the nonidentity elements of 
N are odd. Since the product of two odd permutations is even, that is, an 
element of A,, and N N 4, = (1), the product of any two elements of N 
is (1). Therefore, N = (1) (Exercises 8 and 9). W 
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E Exercises 


A. 1. (a) List all the 3-cycles in S4. 
(b) List all the elements of A, and express each as a product of 3-cycles. 
2. (a) Verify that 4, = (1). 

(b) Show that 4; is a cyclic group of order 3 and hencesimple by Theorem 8.25. 
. Find the center of the group 44- 
. If n = 5, what is the center of A,? 
If ø € S, is a product of disjoint transpositions, prove that g? = (1). 
. Prove that A; has no subgroup of order 30. [Hint: Exercise 23 of Section 8.2.] 


. Prove that N = {(1), (12)(34), (13)(24), (14)(23)} is a normal subgroup of 44. 
Hence, A, is not simple. [Hint: Exercise 23 of Section 7.5. For normality, use 
Exercise | (a) and straightforward computations.] 


w 
Ya wWe w 


8. Prove that no subgroup of order 2 in S, (n = 3) is normal. [Hint: Exercises 26 
of Section 7.5 and 16 of Section 8.2.] 


9. Let N be a subgroup of S, such that o7 = (1) for all nonidentity elements 
oa, TEN. Prove that N = (1) or Nis cyclic of order 2. [Hint: If N + (1), let 
g bea nonidentity element of N. Show that g has order 2. If 7 is any other 
nonidentity element of N, show that g = T.] 


10. If Nis a normal subgroup of S, and N N A, = An, prove that N = A, or Sp- 
[Hint: Why is A, © NS S,? Use Theorem 7.29 and Lagrange’s Theorem.] 


11. Prove that A, is the only subgroup of index 2 in Sẹ. [Hint: Exercise 23 of 
Section 8.2 and Corollary 8.29.] 


12. If f:S, — S, is a homomorphism, prove that {(A,)& Ap 
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CHAPTER 9 


Topics in Group Theory 


This chapter takes a deeper look at various aspects of the classification problem 
for finite groups, which was introduced in Section 8.1. After the necessary pre- 
liminaries are developed in Section 9.1, all finite abelian groups are classified up 
to isomorphism in Section 9.2. The basic tools for analyzing nonabelian groups are 
presented in Sections 9.3 and 9.4. Applications of these results and several other 
facts about the structure of finite groups are considered in Section 9.5, where 
groups of small order are classified. 

Sections 9.3 and 9.4 are independent of Sections 9.1 and 9.2 and may be read 
first if desired. Sections 9.1-9.4 are prerequisites for Section 9.5. 


} 9.1 | Direct Products 


If G and H are groups, then their Cartesian product G X H is also a group, with the 
operation defined coordinatewise (Theorem 7.4). In this section we extend this notion 
to more than two groups. Then we examine the conditions under which a group is 
(isomorphic to) a direct product of certain of its subgroups. When these subgroups are 
of a particularly simple kind, then the structure of the group can be completely deter- 
mined, as will be demonstrated in Section 9.2. Throughout the general discussion, all 
groups are written multiplicatively, but specific examples of familiar additive groups 
are written additively as usual. 

If G,, G,,..., G, are groups, we define a coordinatewise operation on the Cartesian 
product G, X Gz X -++ X Gp as follows: 

(a, Anss., a), b,, Oe) b,) = (abı, a,b, sae > 9 aby). 

It is easy to verify that G, X G, X +++ X G, is a group under this operation: If e, is the 
identity element of G, then (@;, &, . . . , e) is the identity element of G, X G X - -° X Ga 
and (a,~1, a. !, . -a,~ ")is the inverse of (a4, 2, ...,4,). This groupis called the direct 
product of G, G;,..., G,.* 


*When each G; is an additive abelian group, the direct product of G,,..., G, is sometimes called the 
direct sum and denoted G, @ G,.@ ---@G,,. 281 
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EXAMPLE 1 


Recall that U, is the multiplicative group of units in Z, and that U, = {1, 3} 
and U, = {1, 5} (see Theorem 2.10). The direct product U, X Us X Z, consists 
of the 12 triples 
(1, 1, 0), (1, 1, 1), (1, 1, 2), (1, 5, 0), (1, 5, 1), (1, 5, 2), 
(3, l, 0), (83, l 1), 3, ll, 2), (3, 5, 0), GB, 5; 1), (3, 5; 2). 
Note that U, has order 2, U, has order 2, Z, has order 3, and the direct product 
U4 X Us X Z; has order 2 « 2 + 3 = 12. Similarly, in the general case, 
if G, G,,...,G, are finite groups, then 
G, X Gx x G, has order |G * |G! ++ |G} 


In the preceding example it is important to note that the groups U4, Ug, and 
Z, are not contained in the direct product U4 X Us X Z;. For instance, 5 is an 
element of Us, but 5 is not in U, X U; X Z, because the elements of U, X Us X Z, 
are triples. In general, for 1 sisn 


G; is not a subgroup of the direct product G; x G; x +++ X G,.* 


This situation is not entirely satisfactory, but by changing our viewpoint slightly 
we can develop a notion of direct product in which the component groups may 
be considered as subgroups. 


EXAMPLE 2 


It is easy to verify that M = {0, 3} and N = {0, 2, 4} are normal subgroups of 
Z; (Do it!), Observe that every element of Z,can be written asa sumof an ele- 
ment in M and an element in N in one and only one way: 

0=0+0 1=3+4 2=0+2 

3=3+0 4=0+4 §=3+4+2. 


Venfy that, when the elements of Z,are written as sums in this way, then the 
addition table for Z, looks like this: 


0+0 3+4 0+2 3+0 0+4 3+2 
0+0 | 0+0 StA O+2 3+0 O+4 3F2 
3+4 3+4 0+2 3+0 0+4 3+2 0+0 
0+2 0+2 3+0 0+4 3+2 0+0 3+4 
3+0 3+0 0+4 3+2 0+0 3+4 0F2 
0+4 | 0+4 3+2 0+0 3+4 0+2 3+0 
3+2 3+2 0+0 3+4 0+2 3+0 0+4 


*It istrue, however, that an isomorphic copy of G; is a subgroup of G, X G; X » - - X G, (see Exercise 12). 


(Copyright 2012 Cangngp Leacuing. AN Rights Ramrvad May act be copied, ecemet, or duplicaten, in whote or ka part Dus to eiacirunic digits, mma third party contest may be mppamed firun the eBook emiVor aChapter(s). Edimwial erdew has 
damed that noy appead comme does not maianially affect the ovaali learning apeiara Omage learning msarv the right w reamve etSdoosl coomel at asy time if subsequent rights crevictinms require it. 


9.1 Direct Products 283 


Compare the Z, table with the operation table for the direct product M X N: 


(0,0) 64A @©2) (0 0) (3,2) 
(0,0) | (0,0) (3,4) (0,2) BA (0,4) (3,2) 
34 | 69 @2) G0) 04) (3,2) 0,0) 
(0,2) | ©,2) (3,0 (0,4) (G2 (0,0) (3,4 
(3,0) | (3,0) © 4) 6D) (©,0 (34) 0D 
0,4. | @4 £4(2) @©0) (3,4) M) @,0) 
(3,2) | 6D) @©,0 (4) (0,2) (3,0) (0,4) 


The only difference in these two tables is that elements are written a + bin 
the first and (a, b) in the second. Among other things, the tables show that the 
direct product M X N is isomorphic to &, under the isomorphism that assigns 
each pair (a, b)€ M X N to the sum of its coordinates a + bE Z6. 


Consequently, we can express Z, as a direct product in a purely internal fashion, 
without looking at the set M X N, which is external to Z,: Write each element uniquely 
as a sum a + b, with ae M and b EN. We now develop this same idea in the general 
case, with multiplicative notation in place of addition in Z,. 


Theorem 9.1 


Let N;,No...,N, be normal subgroups of a group G such that every element 
in G can be written uniquely in the form aja «+ + a, with a,EN,* Then G is 
isomorphic to the direct product N; X Na X +++ X Ne 


The proof depends on this useful fact: 


Lemma 9.2 


Let M and N be normal subgroups of a group G such that M N N=€).IfaeM 
and b EN, then ab = ba. 


Proof > Consider a—'b-'ab. Since M is normal, b~'ab € M by Theorem 8.11. 
Closure in M shows that a~'b—!ab = a~1(b~'ab) € M. Similarly, the 
normality of N implies that a~'b-'a E€ N and, hence, a~'b~ab = 
(a'b-'a)b E N. Thus a'b“'abe M N N = (e). Multiplying both sides 
of a~'b~'ab = e on the left by ba shows that ab = ba. m 


Proof of Theorem 9.1 » Guided by the example preceding the theorem (but using 
multiplicative notation), we define a map 


SN XN XXN aE by — fA, dy, « - «Ay = at: * 


*Uniqueness means that if a,a,... a, = 5,5). . . b, with each a, b; E N; then a; = b; for every i. 
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Since every element of G can be written in the form ajaz - - + a, (with 

a, N)) by hypothesis, fis surjective. If f(a, &,... , ap) = f(D, bas ..-, bp, 
then aja, +++ a, = b; by + + + by. By the uniqueness hypothesis, a, = b, for 
each i(1 s i sk). Therefore, 


(ar, an -s ak) = (bi, by... , by) in Ny X Np X +++ XM, 


and fis injective. 

In order to prove that fis a homomorphism we must first show that 
the N’s are mutually disjoint subgroups, that is, N, N N, = (e) when i # j. 
If ae N,N N, then a can be written as a product of elements of the N’s 
in two different ways: 


ee ee et ee ee ree a 
t ToO A + T E * f 
N N N M N N N Me 


The uniqueness hypothesis implies that the components in N; must be 
equal: a = e. Therefore, N, N N; = (e) for i + j. In showing that fis a 
homomorphism, we shall make repeated use of this fact, which together 
with Lemma 9.2, implies that a,b, = bja; for a; € N, and bE N; 


Jila - -- abi,- «+ b)] = flab -> - , arbe) 
= aib, ab, a;b; ++ > abg 


= H 3b3 + + aby 
= aja, bya; bb; + + ` apby 


= aa, Az), bab; ` - > apbg. 


Continuing in this way we successively move aj, 4s, . . . , Ag to the left 
until we obtain 


Silar- - + s Or, « «+ ba] = (a102 * * * eM(brb2* + + bi) 
= f(a, eee » afb, . -s by). 


Therefore, f is homomorphism and, hence, an isomorphism. W 


Whenever G is a group and N4, . . . , Ng are subgroups satisfying the hypotheses 
of Theorem 9.1 we shall say that G is the direct product of N, . . - , Ng and write 
G = N, X ++- XN,. Each N, is said to be a direct factor of G. Depending on the con- 
text, we can think of G as the external direct product of the N, (each element a k-tuple 
(a,,.-., a) EN; X +>- X N,) or as an internal direct product (each element written 
uniquely in the form aja, : + - a, E a, E G). 

The next theorem is often easier to use than Theorem 9.1 to prove that a group is 
the direct product of certain of its subgroups. The statement of the theorem uses the 
following notation. If Mand N are subgroups of a group G, then MN denotes the set 
of all products mn, with me M and nE N. 
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Theorem 9.3 


If Mand N are normal subgroups of a group G such that G = MNand MN N=@), 
thenG=MXN. 


For the case of more than two subgroups, see Exercise 25. 


Proof of Theorem 9.3 > By hypothesis every element of G is of the form mn, with 
me M,neEN. Suppose that an element had two such representations, say 
mn = mn, with m, m; E M and n, n EN. Then 


mn = mn 
m`'mn = m'm [Left multiply both sides by m,". 
m ‘mn = ny 

my aan" = ayn [Right multiply both sides by A~] 


But m,'me Mand nn !eNand MN N= (à. Thus m'm = e and 
m = m;; similarly, n = m. Therefore, every element of G can be written 
uniquely in the form mn (mE M, nE N), and, hence, G = M X N by 
Theorem 9.1. Em 


EXAMPLE 3 


By Theorem 2.10, the multiplicative group of units in Z45 is U15 = 

{1, 2, 4, 7, 8, 11, 13, 14}. The groups M = {1, 11} and N = {1, 2, 4, 8} are 
normal subgroups whose intersection is (1). Every element of N is in MN (for 
instance, 2 = 1 - 2), and similarly for M. Since 11-2 = 7, 11-8 = 13, and 
11-4 = 14, we see that Uj; = MN. Therefore, Uj; = M X N by Theorem 9.3. 
Since N is cyclic of order 2 and M cyclic of order 4 (2 is a generator), we con- 
clude that U,; is isomorphic to Z, X Z, (see Exercise 10 and Theorem 7.19). 


@ Exercises 


NOTE: Unless stated otherwise, G,,..., G, are groups. 
A. 1. Find the order of each element in the given group: 
a) Z xZ, ®24,X2,x2, (© DX 
2. What is the order of the group Us X U, X UX U,? 
3. (a) List all subgroups of 7, X Z2. (There are more than two.) 
(b) Do the same for Z, X Z, X Z2. 
4. If Gand H are groups, prove that G X H = H X G. 
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S 


6. 


11. 


12. 


13. 


14. 


15. 


Give an example to show that the direct product of cyclic groups need not be 
cyclic. 

(a) Write Z,, as a direct sum of two of its subgroups. 

(b) Do the same for 2,5. 


(c) Write Z, in three different ways as a direct sum of two or more of its 
subgroups. [Hint: Theorem 9.3.] 


. Let G,..., Ga be groups. Prove that G; X --- X G,,is abelian if and only if 


every G, is abelian. 


. Let i be an integer with 1 = i = n. Prove that the function 


TG, X Gy X -X Ga > G 


given by 7{a), a, a3, . . . , @,) = qis a surjective homomorphism of groups. 


. Is Zs isomorphic to Z4 X.Z,? 


. (a) If f:G, — H; and g:G, — H, are isomorphisms of groups, prove that 


the map @:G, X G,— H; X Hp given by 6(a, b) = (f(a), g(b)) is an 
isomorphism. 


(b) If G= A; fori = 1,2,...,, prove that 
GX +++ G= Hy X +++ X Ay 
Let H, K, M, N be groups such that K = M X N. Prove that H X K = 
HXMXN. 
Let į be an integer with 1 = i = n. Let G, be the subset of G, X --: X G, 
consisting of those elements whose ith coordinate is any element of G, and 
whose other coordinates are each the identity element, that is, 
G, a {(e1, soos Opts By Cpr ays ses en) la EG}. 
Prove that 
(a) G, is a normal subgroup of G, X --- X Gp. 
b) G= G. 
(c) G, X ++ X Gn is the (internal) direct product of its subgroups Giss 
Gp. [Hint: Show that every element of G, X + + + X G, can be written 
uniquely in the form aja, * + an, with a; E G; apply Theorem 9.1.] 


Let G be a group and let D = {(a, a, a) |a E G}. 
(a) Prove that D is a subgroup of G X G X G. 
(b) Prove that D is normal in G X G X Gif and only if G is abelian. 


If G., . . . , G, are finite groups, prove that the order of (aq, a, . . . , a) in 
G, X +++ X G, is the least common multiple of the orders |a;|, jaz}, . . . , |@y|- 


Let i, in ... , i be a permutation of the integers 1, 2, . . . , n. Prove that 


G, X G, X m2 2 XG, 
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is isomorphic to 


G, X GX X Gy 


[Exercise 4 is the case n = 2.] 

16. If N, Kare subgroups of a group G such that G = N X K and M is a normal 
subgroup of N, prove that M is a normal subgroup of G. [Compare this with 
Exercise 14 in Section 8.2.] 

17. Let @* be the multiplicative group of nonzero rational numbers, Q** the 


subgroup of positive rationals, and H the subgroup {1, —1}. Prove that 
Q* = Q** x H. 

18. Prove that Uj, is isomorphic to Z, X Z, [Hint: Theorem 9.3.] 

19. Let G be a group and fi:G > G, Jj: > Ga, .. . , JG > G, homomorphisms. 
Fori=1,2,...,2, let 7, be the homomorphism of Exercise 8. Let 


f*:G —G, X +++ X G, be the map defined by f*(@) = (fila), Jad), -- - s Jan). 
(a) Prove that f* isa homomorphism such that 7,°f'* = f; for each i. 


(b) Prove that f* is the unique homomorphism from G to G, X <- X Ga such 
that 7,°f* = f for every i. 

20. Let N; ,...;, N; be subgroups of an abelian group G. Assume that every 
element of G can be written in the form a, - + - a„ (with aE N,) and that 
whenever aja) -* + a, = e, then q = e for every i. Prove that 
G=N,XN,X+*+X Np 

21. Let Gbe an additive abelian group with subgroups H and K. Prove that 
G = H X Kif and only if there are homomorphisms 


Tı T2 
HES GESK 
5, 5, 


such that 5,(77,(x)) + 6,(72(x)) = x for every x E Gand m7, ° 5, = ty, 72° Ô = tx, 
Tı ° 6 = 0, and 72° 5, = 0, where vy is the identity map on XY, and 0 is the map 
that sends every element onto the zero (identity) element. [Hint: Let 77; be as in 
Exercise 8.] 

22. Let Gand H be finite cyclic groups. Prove that G X H is cyclic if and only if 
(GI, Z|) =1. 

23. (a) Show by example that Lemma 9.2 may be false if N is not normal. 
(b) Do the same for Theorem 9.3. 

24. Let N, K be subgroups of a group G, with N normal in G. If Nand K are 
abelian groups and G = NK, is G the direct product of N and K? 


25. Let Nj, ..., N, be normal subgroups of a group G. Let N,N, - - - N, denote 
the set of all elements of the form aja, + > + a, with a,e¢ N, Assume that 
G =N,N,: n - N, and that 


NIA (Nico NiNa t Nd = @ 


for each i (1 = i = n). Prove that G = N, X Ny X +*+ X Ny. 
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26. Let Ni, . - . , Ng be normal subgroups of a finite group G. If G = N,N; <- Ny 
(notation as in Exercise 25) and |G] = |N,| « |N2| + - > |NjJ, prove that G = 
NiX NX: X Me 


27. Let N, H be subgroups of a group G. G is called the semidirect product of N 
and H if Nis normal in G, G = NH, and NN H = (e). Show that each of the 
following groups is the semidirect product of two of its subgroups: 

@) Sı (b) D, ©) S4 

28. A group G is said to be indecomposable if it is zot the direct product of 
two of its proper normal subgroups. Prove that each of these groups is 
indecomposable: 

@ S (b) DÐ, @Z 

29. If p is prime andn is a positive integer, prove that Z,, is indecomposable. 

30. Prove that Q is an indecomposable group. 

31. Show by example that a homomorphic image of an indecomposable group 
need not be indecomposable. 

32. Prove that a group Gis indecomposable if and only if whenever H and K are 
normal subgroups such that G = H.x K, then H = (e)or K = €). 


33. Let Ibe the set of positive integers and assume that for each iE I, G; is a 
group.* The infinite direct product of the G; is denoted ue G, and consists 


of all sequences (a), @, . . .) with a,€ G,. Prove that IT Gi is a group under the 
coordinatewise operation 


(a, ay,» MA, Bo, .. :) = (4,5), dyb, oe va 


C. 34. With the notation as in Exercise 33, let 2G denote the subset of u G 


consisting of all sequences (c,, ¢2, - . .) EAn that there are at most a SARIE 
number of coordinates with ¢, # e, where e, is the identity element of G, 
Prove that È „Gi is a normal subgroup of IT G; 2 Gi is called the infinite 
direct sum of ‘the G;. 


35. Let G be a group and assume that for each positive integer i, N, isa normal 
ra of G. If every element of G can be written uniquely in the form 
n, * My," °° Rya with i < i < -++ < ig and n, E N, prove that G = > N, (see 
eat 34).? [Hint: Adapt the proof of Theorem 9.1 by dining Kan, Q,..-) 
to be the product of those a, that are not the identity element.] 


36. If (m, n) = 1, prove that U,,, = Un X Up 


*Any infinite index set / may be used here, but the restriction to the positive integers simplifies the 
notation. 


tUniqueness means that if a; --- a, = by: + 6, 


ya With fy <i- < andj, <j <--- <j, then k= t 
andforr=1,2,..,,4:4,=j,and a,b, 
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37. Let H be a group and TH — Gi, T2:H > Gy, ..., T: H —> Ga homomorphisms 
with this property: Whenever G is a group and g):G—> Gi, g2:G—> G;, ..., 
&:G —> G, are homomorphisms, then there exists a unique homomorphism 
g*:G > H such that 7,° g* = g, for every i. Prove that H = G) X G, X ++: X Gy 
[See Exercise 19.] 


92 Finite Abelian Groups 


All finite abelian groups will now be classified. We shall prove that every finite abe- 
lian group G is a direct sum of cyclic subgroups and that the orders of these cyclic 
subgroups are uniquely determined by G. The only prerequisites for the proof other 
than Section 9.1 are basic number theory (Section 1.2) and elementary group theory 
(Chapters 7 and 8, omitting Sections 7.5 and 8.5). 

Following the usual custom with abelian groups, all groups are written in additive 
notation in this section. The following dictionary may be helpful for translating from 
multiplicative to additive notation: 


MULTIPLICATIVE NOTATION ADDITIVE NOTATION 
ab a+b 
e 0 
a& ka 
d=e ka=0 
= {mn|me M,ne N} M+N={m+n|meM,neN} 
direct product M X N direct sum M ® N 
direct factor M direct summand M 


Here is a restatement in additive notation of several earlier results that will be used 
frequently here: 


Theorem 7.9 


Let G be an additive group and letae 6. 


(1) If a hasorder n, then ka = Oif and only ifn | k. 
(3) If a has order td, with d > 0, then fa has order d. W 


Theorem 9.1 


If N,,...,N, are normal subgroups of an additive group G such that every 
element of G can be written uniquely in the form a, + a + --- + a with 
a,EN, thenG =N ON: O- -ON m 
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Theorem 9.3 


If M and N are normal subgroups of an additive group G such that G = M +N 
andMON=(0),thenG=M@ON. m 


Finally we note that Exercise 11 of Section 9.1 will be used without explicit mention 
at several points. 

If Gis an abelian group and p is a prime, then G(p) denotes the set of elements in 
G whose order is some power of p; that is, 


G(p) = {aE G| |a| = p” for some n = 0}. 


It is easy to verify that G(p) is closed under addition and that the inverse of any element 
in G(p) is also in G(p) (Exercise 1). Therefore, G( p) is a subgroup of G. 


EXAMPLE 1 


If G = Z,,, then G(2) is the set of elements having orders 2°, 2', 2%, etc. Verify 
that G(2) is the subgroup {0, 3, 6, 9}; similarly, G(3) = {0, 4, 8}. If G = Z; © Z;, 
then G(3) = Gsince every nonzero element in G has order 3. 


The first step in proving that a finite abelian group G is the direct sum of cyclic 
subgroups is to show that Gis the direct sum of its subgroups G(p), one for each of the 
distinct primes dividing the order of G. In order to do this, we need 


Lemma 9.4 


Let G be an abelian group and aEeG an element of finite order. Then 
a=a,t+at--- +a, witha,EG(p,),wherep;,...,p;arethe distinct positive 
primes that divide the order of a. 


Proof» The proof is by induction on the number of distinct primes that divide the 
order of a. If |a| is divisible only by the single prime p,, then the order of 
ais a power of p, and, hence, a E G(p,). So the lemma is true in this case. 
Assume inductively that the lemma is true for all elements whose order 
is divisible by at most k — 1 distinct primes and that |a| is divisible by the 
distinct primes p;,... , Pa- Then |a| = p,"- - - P”, with each r, > 0. Let 
m = p? ---p,*andn = p}, so that |a| = mn. Then (m, n) = 1 and by 
Theorem 1.2 there are integers u, v such that 1 = mu + nv. Consequently, 


a = la = (mu + nv)a = mua + nva. 


But mua E G(p,) because a has order mn, and, hence, p; (mua) = (nm)ua = 
u(mna) = u0 = 0. Similarly, m(nva) = 0 so that by Theorem 7.9 the order of 
nva divides m, an integer with only k — 1 distinct prime divisors. Therefore, 
by the induction assumption nva = a, + a; + > - - + a, with qE G(p). Let 
a, = mua; then a = mua + nva = a, + a, +--- + ap, witha,EG(p). E 
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Theorem 9.5 


If G is a finite abelian group, then 


G = G(p1) ® Cle) ®--- ® Gy), 


where p;,..., pare the distinct positive primes that divide the order of G. 


Proof» trac G, then its order divides |G| by Corollary 8.6. Hence, a = 
a,+--- + a, with a; E G(p, by Lemma 9.4 (where a, = O if the prime 
P; does not divide |al). To prove that this expression is unique, suppose 
that a; +a, +---+a,=b,+b,+---+ b, witha, b,E Gp). Since Gis 
abelian 


a — by = (b, — a) + (b; — a) +- -+ (b, — a). 


For each i, b; — a,€ G(p,) and, hence, has order a power of p, say p;”. If 
m = p++: p, then m(b, — a) = 0 for i= 2, so that 


m(a, — b) = m(b, — a) + ---+ mb, — 4) =04+---+0=0. 


Consequently, the order of a, — b, must divide m by Theorem 7.9. But 

a, — b, E G(p)), so its order is a power of p,. The only power of p; that 
divides m = p,” --- př is p? = 1. Therefore, a, — b; = 0 and a, = b,- 
Similar arguments for i = 2, . . . , ¢ show that a, = b; for every i. Therefore, 
every element of G can be written uniquely in the form a, + - -- + aş with 
a,€ G(p,) and, hence, G = G(p,) ®--- ® Gp) by Theorem 9.1. W 


If pis a prime, then a group in which every element has order a power of p is called 
a p-group. Each of the G(p,) in Theorem 9.5 is a p-group by its very definition. An 
element a of a p-group B is called an element of maximal order if |b| = |a| for every 
bEB. If |a| = p" and bc B, then b has order p’ with / s n. Since p” = p/p”! we see that 
p'b = p*{p/b) = 0. Hence, 


If a is an element of maximal order p” in a p-group 2, then p*b = 0 for every bE B. 


Note that elements of maximal order always exist in a finite p-group. 

The next step in classifying finite abelian groups is to prove that every finite abelian 
p-group has a cyclic direct summand, after which we will be able to prove that every 
finite abelian p-group is a direct sum of cyclic groups. 


Lemma 9.6 


Let G be a finite abelian p-group and a an element of maximal order in G. Then 
there is a subgroup K of G such that G = @) @ K. 


The following proof is more intricate than most of the proofs earlier in the book. 
Nevertheless, it uses only elementary group theory, so if you read it carefully, you 
shouldn’t have trouble following the argument. 


Copmets E A Lows. AX Sight Bamrend May aot bs copied Erana, or dopficmad, in whale oc in pert Dus to electronic cights, mme third perty couialt may be axppmmad frm the eBook and/or eChagaar(s). Edunriel review hea 
that any Lapprenas coment does oct qanedally affsct the oveli learning anperiance. Cuag age Learning manrves the right to saava midirional contest et any the if sthanqome rights cemrictinns require it. 


292 Chapter9 Topics in Group Theory 


Proof of Lemma 9.6 > Consider those subgroups H of G such that (@ N H = (0). 
There is at least one (H = (0)), and since G is finite, there must be a largest 
subgroup K with this property. Then (a) N K = (0), and by Theorem 9.3 
we need only show that G = (a) + K. If this is zat the case, then there 
is a nonzero b such that b ¢ (a) + K. Let k be the smallest positive inte- 
ger such that p*be (a) + K (there must be one since Gis a p-group and, 
hence, p/b = 0 = 0 + 0€ (a) + K for some positive f). Then 


(1) c=p"b isnotin (j+ K 
and pe = p*D is in (a) + K, say 
(2) pe=tatk (teZ ke K). 


If ahas order p”, then p*x = 0 for all x € G because a has maximal order. 
Consequently, by (2) 


P'ta + pk = ta + k) = p (pe) = p'e = 0. 


Therefore, p*'ta = -pk e(a} N K= (0) and p"~'ta = 0. Theorem 7.9 
shows that p" (the order of a) divides p"~'t, and it follows that p | t, 

say £ = pm. Therefore, pc = ta + k = pma + k, and consequently, 

k = pe — pma = p(c — ma). Let 


(3) d=c-—ma. 


Then pd = p(c — ma) = kE K, but d ¢ K (since c — ma = k' € K would 
imply that c = ma + k' € (a) + K, contradicting (1)). Use Theorem 7.12 
to verify that H = {x + zd | x€ K, zE Z} is a subgroup of G with 

KC H. Since d = 0 + 1d EH and d¢ K, H is larger than K. But K is the 
largest group such that {a} N K = (0), so we must have (a) N H # (0). If w 
is a nonzero element of (#) N H, then 


(4) w=sa=k,+rd (kE K;r,sEZ). 


We claim that p ¥# r; for if r = py, then since pd € K, O + w = sa = k; + 
ypd E (a} N K, a contradiction. Consequently, (p, r) = 1, and by 
Theorem 1.2 there are integers u, v with pu + rv = 1. Then 


c = le = (pu + ruje = ulpe) + v(re) 
= u(ta + k) + o(r(d+ ma)) [by (2) and (3)) 
= u(ta + k) + v(rd + rma) 
= u(ta + k) + v(sa — kı + mma) [by (4)] 
= (ut + vs + rm)a + (uk — vk,)E (a) + K. 


This contradicts (1). Therefore, G = (a) + K, and, hence, G = (a) ® K by 
Theorem 9.3. @ 
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Theorem 9.7 The Fundamental Theorem of Finite 
Abelian Groups 


Every finite abelian group G is the direct sum of cyclic groups, each of prime 
power order. 


Proof» By Theorem 9.5, G is the direct sum of its subgroups G(p), one for each 
prime p that divides |G|. Each G(p) is a p-group. So to complete the 
proof, we need only show that every finite abelian p-group H is a di- 
rect sum of cyclic groups, each of order a power of p. We prove this by 
induction on the order of H. The assertion is true when H has order 2 
by Theorem 8.7. Assume inductively that it is true for all groups whose 
order is less than |H | and let a be an element of maximal order p” in H. 
Then H = @)@ K by Lemma 9.6. By induction, K is a direct sum of 
cyclic groups, each with order a power of p. Therefore, the same is true 
of H=(@)@K. E 


EXAMPLE 2 


The mamber 36 can be written as a product of prime powers in just four 

ways: 36 =2+2+3+3=2:2:+37=2?+3-3 = 2. 3%, Consequently, by 
Theorem 9.7 every abelian group of order 36 must be isomorphic to one of the 
following groups: 


1,04,0404, £0404 4064:04, 404. 


You can easily verify that no two of these groups are isomorphic (the number 
of elements of order 2 or 3 is different for each group). Thus we have a com- 
plete classification of all abelian groups of order 36 up to isomorphism. 


You probably noticed that a familiar group of order 36, namely 24, doesn’t appear 
explicitly on the list in the preceding example. However, it is isomorphic to 7, ® Zo, 
as we now prove. 


Lemma 9.8 
lf (m, k) = 1, then Zm ®© Zk = Ling 


Proof » The order of (1, 1) in Z,, ® Zz is the smallest positive integer £ such that 
(0, 0) = (1, 1) = (t, À. Thus £ = 0 (mod m) and ¢ = 0 (mod k), so that 
m |t and k| t. But (m, k) = 1 implies that mk | t by Exercise 17 in 
Section 1.2. Hence, mk ss t. Since m1, 1) = (mk, mk) = (0, 0) and 
tis the smallest positive integer with this property, we must have mk = 
t = |(1, D|. Therefore, Z,, ® Z; (a group of order mk) is the cyclic group 
generated by (1, 1) and, hence, is isomorphic to Z, by Theorem 7.19. E 
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Theorem 9,9 


fn = pp +. pf, with p, ..., pi distinct primes, then 
Z, = Zp D: s OL om 


Proof The theorem is true for groups of order 2. Assume inductively that it 
is true for groups of order less than x. Apply Lemma 9.8 with m = p,” 
and k = p,” - -~ p". Then Z, = Z,a ® Ze and the induction hypothesis 
shows that Z= Z ® OZ, m 


Combining Theorems 9.7 and 9.9 yields a second way of expressing a finite abelian 
group as a direct sum of cyclic groups. 


EXAMPLE 3 


Consider the group 
G=2,02,0402,04,02:02, 02,02. 


Arrange the prime power orders of the cyclic factors by size, with one row for 
each prime: 


2 2 22 z2 
3 3 3 
5 5? 


Now rearrange the cyclic factors of G using the columns of this array as a guide 
(see Exercise 15 of Section 9.1) and apply Theorem 9.9: 


G = (2) © (LO Z) OZO2,02) © (Z OL ®© Za) 
46-20 &£ © Zo E Zæ 


This last decomposition of G as a sum of cyclic groups is sometimes more 
convenient than the original prime power decomposition: There are fewer 
cyclic factors, and the order of each cyclic factor divides the order of the next 
one. Although the notation is a bit more involved, the same process works in 
the general case and proves the following Theorem. 


Theorem 9,10 


Every finite abelian group is the direct sum of cyclic groups of orders 
My, Moyo 6+, My Where M, | Mo, Mg | Mg, M3 | May 6 ANG Mei | Me 


We pause briefly here to present an interesting corollary that will be used in 
Chapter 11. A version of it was proved earlier as Theorem 7.16. 
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Corollary 9.11 


If G is a finite subgroup of the multiplicative group of nonzero elements of a 
field F, then G is cyclic.* 


Proof» since Gis a finite abelian group, Theorem 9.10 implies that 
G = Zm, ® ` -+ ® Zn, where each m, divides m,. Every element b in 
Zm D- © Lm, satisfies mb = 0 (Why?). Consequently, every element 
g of the multiplicative group G must satisfy g™ = 1, (that is, must be a 
solution of the equation x” — 1p = 0). Since G has order mm, + > + m, 
and x” — 1p = 0 has at most m, distinct solutions in F by Corollary 4.17, 
we musthave t = land G= Zm, E 


If G is a finite abelian group, then the integers m, . . . , m, in Theorem 9.10 are 
called the invariant factors of G. When G is written as a direct sum of cyclic groups 
of prime power orders, as in Theorem 9.7, the prime powers are called the elementary 
divisors of G. Theorems 9.7 and 9.10 show that the order of G is the product of its 
elementary divisors and also the product of its invariant factors. 


EXAMPLE 4 


All abelian groups of order 36 can be classified up to isomorphism in terms 
of their elementary divisors (as in Example 2) or in terms of their invariant 
factors (using the procedure in Example 3): 


ELEMENTARY INVARIANT ISOMORPHIC 


GROUP DIVISORS FACTORS GROUP 
4,04:,04,04, 2, 2, 3,3 6, 6 £60 Ze 
204,04, 2,2, 3° 2, 18 2,024. 
4,.04,0 Z, ?,3,3 3, 12 2,0 Znz 
Z,@ Zs 2,3 36 246 


The Fundamental Theorem 9.7 can be used to obtain a list of all possible abelian 
groups of a given order. To complete the classification of such groups, we must show 
that no two groups on the list are isomorphic, that is, that the elementary divisors of a 
group are uniquely determined.‘ 


Theorem 9.12 


Let G and H be finite abelian groups. Then G is isomorphic to H if and only if 
G and H have the same elementary divisors. 


*If you have not read Sections 3.1 and 4.4, skip this corollary until you have. 


tThe remainder of this section is optional. Theorem 9.12 is often considered to be part of the 
Fundamental Theorem of Finite Abelian Groups. 
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It is also true that G = Hif and only if G and H have the same invariant factors 
(Exercise 24). 


Proof of Theorem 9.12 » 1f Gand H have the same elementary divisors, then both G 
and H are isomorphic to the same direct sum of cyclic groups and, hence, 
are isomorphic to each other. Conversely, if fG — H is an isomorphism, 
then a and f(a) have the same order for each a € G. It follows that for 
each prime p, f(G(p)) = H(p) and, hence, G(p) = H(p). The elementary 
divisors of G that are powers of the prime p are precisely the elementary 
divisors of G(p), and similarly for H. So we need only prove that isomor- 
phic p-groups have the same elementary divisors, In other words, we need 
to prove this half of the theorem only when G and H are p-groups. 

Assume G and H are isomorphic p-groups. We use induction on the 
order of G to prove that G and H have the same elementary divisors. 
All groups of order 2 obviously have the same elementary divisor, 2, by 
Theorem 8.7. So assume that the statement is true for all groups of order 
less than |G|. Suppose that the elementary divisors of G are 


P”, p”, -s P% Ds Ps ++ oP witha; =n, =*+-2n,> 1 


r copies 


and that the elementary divisors of H are 


PEL ie ang Pt, Pepys =P with m = m =`» =m > 1. 
p 
5S copies 

Verify that pG = {px| xE G} is a subgroup of G (Exercise 2). If Gis the 
direct sum of groups C, verify that pG is the direct sum of the groups pC, 
(Exercise 4). If C; is cyclic with generator a of order p”, then pC, is the cyclic 
group generated by pa. Since pa has order p"! by part (3) of Theorem 7.9, 
PC; is cyclic of order p™'. Note that when n = 1 (that is, when G is cyclic of 
order p), then pC, = (0). Consequently, the elementary divisors of pG are 


POOP cag hh 


A similar argument shows that the elementary divisors of pH are 


peg, saaa PAT 

If f:G— His an isomorphism, verify that {(pG) = pH so that pG = pH. 
Furthermore, pG + G (Exercise 9), so that |pG|< |G| Hence pG and pH 
have the same elementary divisors by the induction hypothesis; that is, 
t=kand 

preges so that n — 1=m,-1fori=1,2,...,4. 
Therefore, n, = m, for each i. So the only possible difference in elemen- 
tary divisors of Gand H is the number of copies of p that appear on 
each list. Since |G] is the product of its elementary divisors, and similarly 
for |H], and since G = H, we have 
Pry Py G= E= pp pp. 

Since m, = n, for each i, we must have p” = p‘ and, hence, r = s. Thus G 
and H have the same elementary divisors. W 
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E Exercises 


NOTE: All groups are written additively, and p always denotes a positive prime, unless 
noted otherwise. 


A. 1. If Gis an abelian group, prove that G( p) is a subgroup. 
2. If Gis an abelian group, prove that pG = {px | xE G} is a subgroup of G. 
3. List all abelian groups (up to isomorphism) of the given order: 
(a) 12 (b) 15  (e)30 (d) 72 
(e) 90 (f) 144g) 600 h) 1160 
4. If Gand G,(1 =£ i = n) are abelian groups such that G = G ®: +: @ G,, 
show that pG = pG ®::+ ® PG, 
5. Find the elementary divisors of the given group: 
(a) Zz% (b) Z; ® Zn ® Zis 
(c) Zo ® Zn ® Zs ® Zao (d) Z ® Zz © Zio © Zuo 
6. Find the invariant factors of each of the groups in Exercise 5. 


B. 7. Find the elementary divisors and the invanant factors of the given group. Note 
that the group operation is multiplication in the first three and addition in the last. 
(a) Us (b)Un (@ Us (d) MD) 
8. If G is the additive group Q/Z, what are the elements of the subgroup G(2)? 
Of G{p) for any positive prime p? 
9. (a) If Gisa finite abelian p-group, prove that pG # G. 
(b) Show that part (a) may be false if G is infinite. [Hint: Consider the group 
G(2) in Exercise 8.] 
10. If Gis an abelian p-group and (n, p) = | prove that the map f: G > G given by 
f(a) = na isan isomorphism. 
11. If Gisa finite abelian p-group such that pG = {0}, prove that G=Z,@---@Z, 
for some finite number of copies of Z,- 


12. (Cauchy’s Theorem for Abelian Groups) If Gis a finite abelian group and p is 
a prime that divides |G|, prove that Gcontains an element of order p. 
[Hint: Use the Fundamental Theorem to show that G has a cyclic subgroup 
of order p*; use Theorem 7.9 to find an element of order p.] 


13. Prove that a finite abelian p-group has order a power of p. 


14. If Gis an abelian group of order p'm, with (p, m) = 1, prove that G(p) has 
order p’. 


15. If Gis a finite abelian group and p is a prime such that p” divides |G}, then 
prove that G has a subgroup of order p”. 


16. For which positive integers n is there exactly one abelian group of order n (up 
to isomorphism)? 
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17. Let G, H, K be finite abelian groups. 
(a) If G ® G = H ® H, prove that G = H. 
(b) If G® H = G @® K, prove that H = K. 


18. If Gis an abelian group of order n and k | n, prove that there exist a group H 
of order k and a surjective homomorphism G —> H. 


19. Let G be an abelian group and T the set of elements of finite order in G. Prove 
that 


(a) T is a subgroup of G (called the torsion subgroup). 
(b) Every nonzero element of the quotient group G/T has infinite order. 


20. If G is an abelian group, do the elements of infinite order in G (together with 
0) form a subgroup? [Hint: Consider Z ® Z;.] 


C. 21. If Gis an abelian groupand f:G — Z a surjective homomorphism with kernel 
K, prove that G has a subgroup H such that H = Z and G = K@ H. 


22. Let G and H be finiteabelian groups with this property: For each positive 
integer m the number of elements of order m in G is the same as the number 
of elements of order m in H. Prove that G = H. 


23. Let G be finite abelian group with this property: For each positive integer 
m such that m | |G], there are exactly m elements in G with order dividing m. 
Prove that G is cyclic. 


24. Let G and H be finite abelian groups. Prove that G = H if and only if G and H 
have the same invariant factors. 


25. If Gis an infinite abelian torsion group (meaning that every element in G has 
finite order), prove that G is the infinite direct sum È G(p), where the sum is 
taken over all positive primes p. [Hint: See Exercises 34 and 35 in Section 9.1 
and adapt the proof of Theorem 9.5.] 


9.3 The Sylow Theorems 


Nonabelian finite groups are vastly more complicated than finite abelian groups, 
which were classified in the last section. The Sylow Theorems are the first basic step 
in understanding the structure of nonabelian finite groups. Since the proofs of these 
theorems are largely unrelated to the way the theorems are actually used to analyze 
groups, the proofs will be postponed to the next section.* In this section we shall try 
to give you a sound understanding of the meaning of the Sylow Theorems and some 
examples of their applications. 

Throughout the general discussion in this section a// groups are written multiplica- 
tively and all integers are assumed to be nonnegative. 


*Puritans who believe that the work musf come before the fun should read Section 9.4 before 
proceeding further. 
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Once again the major theme is the close connection between the structure of 
a group G and the arithmetical properties of the integer |G|. One of the most im- 
portant results of this sort is Lagrange’s Theorem, which states that if G has a 
subgroup H, then the integer |H] divides |G|. The First Sylow Theorem provides a 
partial converse: 


Theorem 9.13 First Sylow Theorem 


Let G be a finite group. If p is a prime and p* divides |G|, then G has a 
subgroup of order p". E 


EXAMPLE 1 


The symmetric group S, has order 6! = 720 = 2‘ + 3? + 5. The First Sylow 
Theorem (with p = 2) guarantees that S, has subgroups of orders 2, 4, 8, and 
16. There may well be more than one subgroup of each of these orders. For 
instance, there are at least 60 subgroups of order 4 (Exercise 1). Applying the 
theorem with p = 3 shows that Ss has subgroups of orders 3 and 9. Similarly, 
Se has at least one subgroup of order 5. 


If p is a prime that divides the order of a group G, then G contains a subgroup K 
of order p by the First Sylow Theorem. Since K is cyclic by Theorem 8.7, its generator 
is an element of order p in G. This proves 


Corollary 9.14 Cauchy’s Theorem 


If G is a finite group whose order is divisible by a prime p, then G contains an 
element of order p. E 


Let G be a finite group and pa prime. If p” is the largest power of p that divides |G|, 
then a subgroup of G of order p" is called a Sylow p-subgroup. The existence of Sylow 
p-subgroups is an immediate consequence of the First Sylow Theorem. 


EXAMPLE 2 


Since S4 has order 4! = 24 = 23 - 3, every subgroup of order 8 is a Sylow 
2-subgroup. You can readily verify that 


{(1), (1234), (13)(24), (1432), (24), (12)(34), (13), (14)(32)} 


is a subgroup of order 8 and, hence, a Sylow 2-subgroup. There are two other 
Sylow 2-subgroups (Exercise 2). Any subgroup of S, of order 3 is a Sylow 
3-subgroup. Two of the four Sylow 3-subgroups are {(123), (132), (1)} and 
{(134), (143), (1)}. 
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EXAMPLE 3* 


Let p bea prime and Ga finite abelian group of order p*m, where p y m. Then 
G(p) = {a€ G| la| = p* for some k = 0} 


is a Sylow p-subgroup of G since G(p) has order p" by Exercise 14 of Section 9.2. 
As we shall see, G(p) is the unique Sylow p-subgroup of G. Theorem 9.5 shows 
that G is the direct sum of all its Sylow subgroups (one for each of the distinct 
primes that divide |G). 


Let G be a group and x € G. Example 9 of Section 7.4 shows that the map f: G + G 
given by f(a) = x~'ax is an isomorphism. If K is a subgroup of G, then the image of K 
under fis x-'Kx = {x~'kx|k eK}. Hence, x~! Kx is a subgroup of G that is isomorphic 
to K. In particular, x-'Kx has the same order as K. Consequently, 


if K is a Sylow p-subgroup of G, then so is x~'Kx. 


The next theorem shows that every Sylow p-subgroup of G can be obtained from K in 
this fashion. 


Theorem 9.15 Second Sylow Theorem 


If P and K are Sylow p-subgroups of a group G, then there exists xe G such 
that P =x-'Kxy. m 


Theorem 9.15, together with the italicized statement in the preceding paragraph, 
shows that 


any two Sylow p-subgroups of G are isomorphic. 


Corollary 9.16 


Let G be a finite group and K a Sylow p-subgroup for some prime p. Then K is 
normal in G if and only if K is the only Sylow p-subgroup in G. 


Proof» We know that x-!Kx is a Sylow p-subgroup for every x EG. If K is the 
only Sylow p-subgroup of G, then we must have x~!Kx = K for every 
xE G. Therefore, K is normal by Theorem 8.11. Conversely, suppose 
K is normal and let P be any Sylow p-subgroup. By the Second Sylow 
Theorem there exists x € G such that P = x"'Kx. Since K is normal, 
P =x~'Kx = K. Therefore, K is the unique Sylow p-subgroup. E 


*Skip this example if you haven't read Section 9.2. 
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The preceding theorems establish the existence of Sylow p-subgroups and the rela- 
tionship between any two such subgroups. The next theorem tells us how many Sylow 
p-subgroups a given group may have. 


Theorem 9.17 Third Sylow Theorem 


The number of Sylow p-subgroups of a finite group G divides |G | and is of the 
form 1 + pk for some nonnegative integer k. 


Applications of the Sylow Theorems 


Simple groups (those with no proper normal subgroups) are the basic building blocks 
for all groups. So it is useful to be able to tell if there are any simple groups of a partic- 
ular order. The Third Sylow Theorem, together with appropriate counting arguments 
and Corollary 9.16, can often be used to establish the existence of a proper normal 
subgroup of a group G, thus showing that G is not simple. 


EXAMPLE 4 


If Gis a group of order 63 = 3? - 7, then each Sylow 7-subgroup has order 7 and 
the number of such subgroups is a divisor of 63 of the form 1 + 7k by the Third 
Sylow Theorem. The divisors of 63 are 1, 3, 7, 9, 21, 63 and the numbers of the 
form 1 + 7k (with k = 0) are 1, 8, 15, 22, 29, 36, 43, 50, 57, 64, etc. Since 1 is the 
only number on both lists, Ghas exactly one Sylow 7-subgroup. This subgroup is 
normal by Corollary 9.16. Consequently, no group of order 63 is simple. 


EXAMPLE 5 


We shall show that there is no simple group of order 56 = 2° - 7. The only 
divisors of 56 of the form 1 + 7k are 1 and 8. So G has either one or eight 
Sylow 7-subgroups, each of order 7. If there is just one Sylow 7-group, it 

has to be normal by Corollary 9.16. So Gis not simple in that case. If G has 
eight Sylow 7-groups, then each of them has six nonidentity elements, and 
each nonidentity element has order 7 by Corollary 8.6. Furthermore, the 
intersection of any two of these subgroups is (e) by Exercise 21 of Section 8.1. 
Consequently, there are 8 : 6 = 48 elements of order 7 in G. Every Sylow 
2-subgroup of Ghas order 8. Each element of a Sylow 2-subgroup must have 
order dividing 8 by Corollary 8.6 and, therefore, cannot be in the set of 48 
elements of order 7. Thus there is room in G for only one group of order 8. 
In this case, therefore, the single Sylow 2-subgroup of order 8 is normal by 
Corollary 9.16, and G is not simple. 


In the preceding examples, the Sylow Theorems were used to reach a negative con- 
clusion (the group is not simple). But the same techniques can also lead to positive 
results. In particular, they allow us to classify certain finite groups. 
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Corollary 9.18 
Let G be a group of order pg, where p and q are primes such that p > q. If 
gq x (p — 1), then G = Zop 


Proof» By the Third Sylow Theorem, the number of Sylow p-subgroups must divide 
|G| = p¢, and hence, must be one of 1, p, q, or pg. However, the mmber 
must also be of the form 1 + pk for some integer k. Since p > q, we cannot 
have g = 1 + pk. Furthermore, both p = 1 + pk and pq = 1 + pk imply that 
P | 1, which is impossible. Therefore, there is exactly one Sylow p-subgroup 
H of order p, which is normal by Corollary 9.16. A similar argument (using 
the fact that q + (p — 1)) shows that there is a unique Sylow gsubgroup K 
of order q, which is also normal. Since H N K is a subgroup of both H and 
K, its order must divide both |H] = p and |K| = q by Lagrange’s Theorem. 
Hence, H N K = (e). Exercise 15 shows that G = HK. Therefore, 

G = H X K by Theorem 9.3. But H = Z, and K = Z, by Theorem 8.7. 
Consequently, by Lemma 9.8, G = H X K = Z, XZ,=7,,.* 


EXAMPLE 6 


It is now easy to classify all groups of order 15 = 5 + 3. Apply Corollary 9.18 
with p = 5, q = 3 to conclude that every group of order 15 is isomorphic to Zs. 
Similarly, there is a single group (up to isomorphism) for each of these orders: 
33 = 11-3,35=7-5,65 = 13 » 5,77 = 11 - 7,and 91 = 13-7. 


Other applications of the Sylow Theorems are given in Section 9.5. 


E Exercises 


NOTE: Unless stated otherwise, G is a finite group and p is a positive prime. 


A. 1. Show that S; has at least 60 subgroups of order 4. [Hint: Consider cyclic 
subgroups generated by a 4-cycle (such as ((1234))) or by the product of 
a 4-cycle and a disjoint transposition (such as ((1234)(56))); also look at 
noncyclic subgroups, such as {(1), (12), (34), (12)(34)}.] 


2. (a) List three Sylow 2-subgroups of S4- 
(b) List four Sylow 3-subgroups of S4. 
3. List the Sylow 2-subgroups and Sylow 3-subgroups of 44. 


4. List the Sylow 2-subgroups, Sylow 3-subgroups, and Sylow 5-subgroups of 
Ziz X Zn X Zio. [Section 9.2 is a prerequisite for this exercise] 


*The proof of Lemma 9.8 is independent of the rest of Section 9.2 and may be read now if you skipped 
that section. 
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5. How many Sylow p-subgroups can G possibly have when 
(a) p = 3 and |G| =72 (b) p = Sand |G| = 60 
6. Classify all groups of the given order: 


(a) 115 (b) 143 (c) 391 
7. Prove that there are no simple groups of the given order: 
(a) 42 (b) 200 (c) 231 (d) 255 


B. 8. Use Cauchy’s Theorem to prove that a finite p-group has order p” for some n = 0. 


9. If N is anormal subgroup of a (not necessarily finite) group G and both N 
and G/N are p-groups, then prove that G is a p-group. 


10. If His a normal subgroup of Gand |H| = p*, show that H is contained in 
every Sylow p-subgroup of G. [You may assume Exercise 24 in Section 9.4.] 


11. If fis an automorphism of Gand K is a Sylow p-subgroup of G, is it true that 
S(K) = K? 

12. Let K be a Sylow p-subgroup of Gand H any subgroup of G. Is KN Ha 
Sylow p-subgroup of H? [Hint: Consider S,.] 


13. If every Sylow subgroup of G is normal, prove that G is the direct product of 
its Sylow subgroups (one for each prime that divides |G[). A group with this 
property is said to be nilpotent. 


14. If p is prime, prove that there are no simple groups of order 2p. 


15. (a) If Hand Kare subgroups of G, then HK denotes the set 
{hk © G| hE H,keEK}. If HN K = (e), prove that |HK| = |H| |K|. 
[Hint: If hk = hk, then Ay'h = kk] 


(b) If H and Kare any subgroups of G, prove that 
| H| +|K| 

AK| = ——. 

ee] |HNK| 


16. If Gis a group of order 60 that has a normal Sylow 3-subgroup, prove that 
G also has a normal Sylow 5-subgroup. 


17. If Gis anoncyclic group of order 21, how many Sylow 3-subgroups does 
G have? 


18. If Gis a simple group of order 168, how many Sylow 7-subgroups does 
G have? 


19. If p and q are distinct primes, prove that there are no simple groups of order pq. 
20. If G has order p*m with m < p, prove that G is not simple. 
21. Prove that there are no simple groups of order 30. 
22. If p and q are distinct primes, prove that there is no simple group of order p’q. 
23. (a) If |G| = 105, prove that G has a subgroup of order 35. 

(b) If |G] = 375, prove that G has a subgroup of order 15. 
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24. Let K be a Sylow p-subgroup of Gand N a normal subgroup of G. Prove that 
K N Nisa Sylow p-subgroup of N. 


C. 25. If p, q, r are primes with p < q < r, prove that a group of order pqr has a 
normal Sylow r-subgroup and, hence, is not simple. 


9.4 | Conjugacy and the Proof of the Sylow Theorems 


Appendix D (Equivalence Relations) is a prerequisite for this section. The proofs of the 
Sylow Theorems depend heavily on the concept of conjugacy, which we now develop. 
Let G be a group and a, be G. We say that a is conjugate to b if there exists x€ G 
such that b = x` 'ax. For example, (12) is conjugate to (13) in S, because 
(123)~'(12)(123) = (132)(12)(123) = (13). 
The key fact about conjugationis 


Theorem 9,19 


Conjugacy is an equivalence relation on G. 


Proof» We write a ~b if ais conjugate to b. Reflexive. a ~ a since a = eae = ¢ ‘ae. 
Symmetric: If a ~ b, then b = x7'ax for some x in G. Multiplying on the 
left by x and on the right by x~! shows that a = xbx7! = (x7'-'bx7), 
Hence, b ~ a. Transitive: If a ~ band b ~ c, then b = x~'ax and c = y' 
for some x, y EG. Hence, ¢ = y(x axy = (yx!) axy) = CPA). 
Thus a ~ c; therefore, ~ is an equivalence relation. W 


The equivalence classes in G under the relation of conjugacy are called conjugacy 
classes. The discussion of equivalence relations in Appendix D shows that 


The conjugacy class of an element a consists of all the elements in G that are 
conjugate to a. 

Two conjugacy classes are either disjoint or identical. 

The group G is the union of its distinct conjugacy classes. 


EXAMPLE 1 


The conjugacy class of (12) in S, consists of all elements x(12)x, with xE S3. 
A straightforward computation shows that for any x € Sy, x-'(12)x is one of 
(12), (13), or (23); for instance, 
(23) '(12)(23) = (23)(12)(23) = (13) 
(132)-1(12)(132) = (123)(12)(132) = (23). 
Thus the conjugacy class of (12) is {(12), (13), (23)}. Similar computations show 
that there are three distinct conjugacy classes in S,: 


KD}  {(123), (132)}  4(12), (13), (23)}. 
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Although these conjugacy classes are of different sizes, note that the number of 
elements in any conjugacy class (1, 2, or 3) is a divisor of 6, the order of S;. We 
shall see that this phenomenon occurs in the general case as well. 


Let G be a group and a E G. The centralizer of a is denoted C(a) and consists of all 
elements in G that commute with a, that is, 


Ca) = {gEG| ga = ag}. 


If G = S, and a = (123), for example, you can readily verify that C(a) = 
{(1), (123), (132)} and that C(a) is a subgroup of S,. If a is a nonzero rational number 
in the multiplicative group Q*, every element of O* commutes with a, so C(a) is the 
entire group Q*. These examples are illustrations of 


Theorem 9,20 


If G isa group and ae G, then C(a) is a subgroup of G. 


Proof since ea = ae, we have e E C(a), so that C(a) is nonempty. If g, h € C(a), 
then 


(gh)a = g(ha) = g(ah) = (ga)h = (ag)h = a(gh). 


So gh E C(a), and C(a) is closed. Multiplying ga = ag on both the left 
and right by g`" shows that ag~! = g—!a. Hence, g € C(a) implies that 
g 'e€ C(a). Therefore, C(a) is a subgroup by Theorem 7.11. Em 


The centralizer leads to a very useful fact about the size of conjugacy classes: 


Theorem 9,21 


Let G be a finite group and a e G. The number of elements in the conjugacy 
class of a is the index [G:C(a)}] and this number divides |G]. 


Proof» For notational convenience, we shall sometimes denote C(a) by C in this 
proof. Let S be the set of distinct right cosets of C in G, and let T be the 
conjugacy class of a in G (which consists of the distinct conjugates of a). 
Define a function f:S > T by the rule: f(Cx) = x~'ax. We shall show 
below that f is a well-defined bijection of sets, which means that S and 
T have the same number of elements. The number of elements in S is 
the number of distinct right cosets of C(a), namely [G:C(a)], and the 
number of elements in T is the number of distinct conjugates of a. This 
proves the first part of the theorem. As for the final part, the number 
[G:C(a)] divides |G] by Lagrange’s Theorem 8.5. 
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Now for the details: Reading each of the following “if and only if” 
statements in the direction = shows that f is well defined (meaning that 


Cx = Cy implies f(Cx) = fC Cy)): 


Cx = Cy xy EC [Theorem 8.2] 
+> (xy Ja = a(xy') [Definition of C] 
s a= (xy) axy [Lefi multiply by Gy") 
+a = yx axy” [Corollary 7.6] 
+ y lay = x ax [Left multiply by y`! and 
right multiply by y.] 
<= f(Cy) = fC Cx) [Definition of f] 


Reading these same statements in the direction = from bottom to top 
shows that /(Cx) = fi (Cy) implies Cx = Cy, so that fis injective.* Finally, 
fis surjective because, given any conjugate u~1au of a, it is the image of 
the coset Cu. Therefore, fis bijective and the proof is complete. Wi 


Let G be a finite group and let Cj, Cz.. . . , C, be the distinct conjugacy classes of G. 
Then G = Ci U C, U --- U C, Since distinct conjugacy classes are mutually disjoint, 
(1) IG)=|QQUGU+*-UC)=(ICI1+IGi+°°: + G, 
where |C;| denotes the number of elements in the class C; Now choose one element, 
say a;, in each class C, Then G consists of all the conjugates of a, By Theorem 9.21, 
IC] is precisely [G:C(a)], a divisor of |G]. So equation (1) becomes 
(2) IGI = [G:C(a,)] + [(G:C(@,)] + + + > + [G:C(a))). 

This equation (in either version (1) or (2)) is called the class equation of the group G. It 


will be the basic tool for proving the Sylow Theorems. Other applications of the class 
equation are discussed in Section 9.5. 


EXAMPLE 2 


In Example 1 we saw that S} has three distinct conjugacy classes of sizes 1, 2, 
and 3. Since |S;| = 6, the class equation of S, is 6 = 1 +2 + 3. 


If c and x are elements of a group G, then ex = xc if and only if x-'ex = c. Thus c is 
in the center of G [cx = xc for every x € G)if and only if c has exactly one conjugate, itself 
[x~ tex = c for every x € G]. Therefore, the center Z(G) of Gis the union of all the one- 
element conjugacy classes of G, so that the class equation can be wnitten in a third form: 


(3) IGI = |2(G)| + IG) + IGI + +++ + IG, 


where G, ..., C, are the distinct conjugacy classes of G that contain more than one 
element each and each |C} divides |G]. 

In addition to the class equation, one more result is needed for the proof of the 
Sylow Theorems. 


*The reasons in the right-hand column above must be adjusted when reading from bottom to top 
(Exercise 4). 
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Lemma 9.22 Cauchy’s Theorem for Abelian Groups 


If G is a finite abelian group and p is a prime that divides the order of G, then 
G contains an element of order p. 


The lemma is an immediate consequence of the Fundamental Theorem of Abelian 
Groups (Exercise 12 in Section 9.2). The following proof, however, depends only on 
Chapters 7 and 8. 


Proof of Lemma 9.22 > The proof is by induction on the order of G, using the 
Principle of Complete Induction.* To do this, we must first show that 
the theorem is true when |G] = 2. In this case, if p divides |G], then p = 2. 
The nonidentity element of G must have order 2 by part (1) of 
Corollary 8.6, and so the theorem is true. 

Now assume that the theorem is true for all abelian groups of order 
less than n and suppose |G] = n. Let a be any nonidentity element of 
G. Then the order of a is a positive integer and is therefore divisible by 
some prime g (Theorem 1.8), say |a| = gt. The element b = a‘ has order 
q by Theorem 7.9. If q = p, the theorem is proved. If q + p, let N be the 
cyclic subgroup (b). N is normal since G is abelian and N has order q by 
Theorem 7.15. By Theorem 8.13 the quotient group G/Nhas order 
IGVIN| = n/q < n. Consequently, by the induction hypothesis, the theorem 
is true for G/N. The prime p divides |G], and |G| = |N||G/N| = ¢|G/M. 
Since g is a prime other than p, p must divide |G/M by Theorem 1.5. 
Therefore, G/N contains an element of order p, say Ne. Since Ne has 
order p in G/N, we have Nc? =(Nc)? = Ne and, hence, ce” E N. Since N 
has order q, c?? = (cP) = e by part (2) of Corollary 8.6. 

Therefore, c must have order dividing pg by Theorem 7.9. However, 
c cannot have order 1 because then Ne would have order 1 instead of p 
in G/N. Nor can ¢ have order q because then (Ne)? = Nc! = Ne in G/N, 
so that p (the order of Nc) would divide q by Theorem 7.9. The only 
possibility is that c has order p or pq; in the latter case, cf has order p by 
Theorem 7.9. In either case, G contains an element of order p. Therefore, 
the theorem is true for abelian groups of order n and, hence, by induc- 
tion for all finite abelian groups. E 


Proofs of the Sylow Theorems 


We now have all the tools needed to prove the Sylow Theorems. 


Proof of the First Sylow Theorem 9.13» The proof is by induction on the order 
of G. If |G] = 1, then p° is the only prime power that divides |G], and G 
itself is a subgroup of order p’. Suppose |G] > 1 and assume inductively 
that the theorem is true for all groups of order less than |G]. Combining 
the second and third forms of the class equation of G shows that 


IG] = |Z(G)| + [G:C(@)] + [G:C@)] +--+ + [G:C@)], 


“See Appendix C. 
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where for each i, [G:C(a,] > 1. Furthermore, |Z(G)| = 1 (since e € Z(G)), 
and |C(a)| < |G| (otherwise, [G:C(@)] = 1). 

Suppose there is an index j such that p does not divide [G:C(a,)]. Then 
by Theorem 1.5 p* must divide |C(a)| because PF divides |G| by hypothesis 
and |G] = |C(@,)| - [G:C(a,] by Lagrange’s Theorem. Since the subgroup 
C(a,) has order less than |G], the induction hypothesis implies that C(a,), 
and, hence, G has a subgroup of order p*. 

On the other hand, if p divides [G:C(@;)] for every i, then since p 
divides |G|, p must also divide |G| — [G:C(a,)] — -- > — [G:C@)] = 
|Z(G)|. Since Z(G) is abelian, Z(G) contains an element ¢ of order p by 
Lemma 9.22. Let N be the cyclic subgroup generated by c. Then N has 
order p and is normal in G (Exercise 8). Consequently, the order of the 
quotient group G/N, namely |G|/p, is less than |G] and divisible by po, 
By the induction hypothesis G/N has a subgroup T of order p*!. There 
is a subgroup H of G such that N C H and T = H/N by Theorem 8.24. 
Lagrange’s Theorem shows that 


(| = INI + (H/M = M + IT] = ppt! = pt. 
So G has a subgroup of order p* in this case, too. I 
The basictools needed to prove the last two Sylow Theorems are very similar to those 
used above, except that we will now deal with conjugate subgroups rather than conjugate 


elements. More precisely, let H be a fixed subgroup of a group Gand let A and B be any 
subgroups of G. We say that 4 is H-conjugate to B if there exists an x€ H such that 


B=x"'Ax = {x7 ax|aeA}. 


In the special case when H is the group G itself, we simply say that A is conjugate to B, 
or that B is a conjugate of A. 


Theorem 9,23 


Let H be a subgroup of a group G. Then H-conjugacy is an equivalence rela- 
tion on the set of all subgroups of G. 


Proof» Copy the proof of Theorem 9.19, using subgroups A, B, C in place of 
elements a,b,c. E 


Let A be a subgroup of a group G. The normalizer of A is the set N(A) 
defined by 
N(A) = {gEG| "dg = A}. 


Theorem 9,24 


If A is a subgroup of a group G, then NA) is a subgroup of G and A is anormal 
subgroup of N(A). 
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Proof» Exercise 7 shows that 4 c M(A) and that g E€ N(A) if and only if Ag = gA. 
Using this fact, the proof of Theorem 9.20 can be readily adapted to prove 
that N(A) is a subgroup. The definition of N(A) shows that 4 is normal 
in N(A). E 


Theorem 9.25 


Let H and A be subgroups of a finite group G. The number of distinct 
H-conjugates of A (that is, the number of elements in the equivalence class 
of A under H-conjugacy) is [H:H N N(A)] and, therefore, divides |H|. 


Proof» The proof of Theorem 9.21 carries over to the present situation if you 
replace G by H, aby A, and C by HN NA). B 


Lemma 9.26 


Let Q be a Sylow p-subgroup of a finite group G. If x EG has order a power of 
p and x~'Qx = Q, then xe. 


Proof» since Q is normal in N(Q) by Theorem 9.24, the quotient group M(Q)/Q is 
defined. By hypothesis, x € M(Q). Since |x| is some power of p, the coset 
Qx in N(Q)/Q also has order a power of p. Now Qx generates a cyclic 
subgroup T of N(Q)/Q whose order is a power of p. By Theorem 8.24, 
T = A/Q, where H is a subgroup of G that contains Q. Since the 
orders of the groups Q and T are each powers of p and |H| = |Q| : |7 | 
by Lagrange’s Theorem, |H| must be a power of p. But Q C H, and |Q] 
is the largest power of p that divides |G] by the definition of a Sylow 
p-subgroup. Therefore, Q = H, and, hence, T = H/Q is the identity 
subgroup. So the generator Qx of T must be the identity coset Qe. The 
equality Qx = Qe implies that xE Q. E 


Proof of the Second Sylow Theorem 9.15 > Since K is a Sylow p-subgroup, K has 
order p”, where |G] = p’m and p + m. Let K = K}, K2, . - . , K, be the dis- 
tinct conjugates of K in G. By Theorem 9.25 (with H = G and K = A), 

t = [G:N(K)]. Note that p does not divide t [reason: f'm = |G| = 
IN(K)| = [G:N(K)] = |N(K)| + t and p" divides |N(K)| because K is a subgroup 
of N(K)]. We must prove that the Sylow p-subgroup P is conjugate to K, 
that is, that P is one of the K;. To do so we use the relation of P-conjugacy. 
Since each K, is a conjugate of K, and conjugacy is transitive, every 
conjugate of K,in Gis also a conjugate of K;. In other words, every con- 
jugate of K, is some K, Consequently, the equivalence class of K, under 
P-conjugacy contains only various K, So the set S = {Kj, Kz, . - - Kj} 
of all conjugates of Kis a union of distinct equivalence classes under 
P-conjugacy. The number of subgroups in each of these equivalence 
classes is a power of p because by Theorem 9.25 the number of sub- 
groups that are P-conjugate to K, is [P: P N N(K;)], which is a divisor of 
|P| = p” by Lagrange’s Theorem. Therefore, t (the number of subgroups 
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in the set S) is the sum of various powers of p (each being the number of 
subgroups in one of the distinct equivalence classes whose union is S). 
Since p doesn’t divide t, at least one of these powers of p must be p’ = 1. 
Thus some K; is in an equivalence class by itself, meaning that 

x'K,x = Kfor every xe P. Lemma 9.26 (with Q = K) implies that 

x EK, for every such x, so that P S K, Since both P and K, are Sylow 
p-subgroups, they have the same order. Hence, P = K; @ 


Proof of the Third Sylow Theorem 9.17> Let s = {K,,..., K} be the set of all 
Sylow p-subgroups of G. By the Second Sylow Theorem, they are all the 
distinct conjugates of K,. The proof of the Second Sylow Theorem shows 
that t = [G: N (K,)], which divides the order of G by Lagrange’s Theorem. 

Let P be one of the K, and consider the relation of P-conjugacy. The 
only P-conjugate of P is P itself by closure. The proof of the Second Sylow 
Theorem shows that the only equivalence class consisting of a single sub- 
group is the class consisting of P itself. The proof also shows that S is the 
union of distinct equivalence classes and that the number of subgroups in 
each class is a power of p. Just one of these classes contains P, so the mum- 
ber of subgroups in each of the others is a positive power of p. Hence, the 
number t of Sylow p-subgroups is the sum of 1 and various positive powers 
of p and, therefore, can be written in the form 1 + kp for some integer k. E 


E Exercises 


NOTE: Unless stated otherwise, G is a finite group and p is a positive prime. 
A. 1. List the distinct conjugacy classes of the given group. 
(a) D, OS © 4 


2. If a E G, then show by example that C(a) may not be abelian. [Hint: If 
a = (12) in S, then (34) and (345) are in C{a).] 


3. If His a subgroup of Gand ac H, show by example that the conjugacy class 
of ain H may not be the same as the conjugacy class of ain G. 


4. Write out the part of the proof of Theorem 9.21 showing that fis injective, 
including the reasons for each step. Your answer should begin like this: 
f(Cy) =f(C x)= yay = xax [Definition of f] 
=a = yx axy !. [Left multiply by y and right multiply by y`] 
5. List all conjugates of the Sylow 3-subgroup ((123)) in Sy. 


6. If Hand K are subgroups of G and H is normal in K, prove that Kis a 
subgroup of N(#). In other words, N(H) is the largest subgroup of G in which 
His a normal subgroup. 


7. If Ais a subgroup of G, prove that 
(a) AS MA); 
(b) ge N(A) if and only if Ag = gA. 
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8. If N is a subgroup of Z(G), prove that N is a normal subgroup of G. 


B. 9. If Cis a conjugacy class in G and fis an automorphism of G, prove that f(C) is 
also a conjugacy class of G. 


10. Let G be an infinite group and H the subset of all elements of G that have only 
a finite number of distinct conjugates in G. Prove that H is a subgroup of G. 


11. If Gis a nilpotent group (see Exercise 13 of Section 9.3), prove that G has 
this property: If m divides |G|, then G has a subgroup of order m. [You may 
assume Exercise 22.] 


12. Let K be a Sylow p-subgroup of Gand N a normal subgroup of G. If Kisa 
normal subgroup of N, prove that K is normal in G. 


13. Prove Theorem 9.23. 

14. Let N be a normal subgroup of G, a € G, and C the conjugacy class of a in G. 
(a) Prove that ac N if and only if CEN. 
(b) If C,is any conjugacy class in G, prove that C,¢ Nor GN N = Ø. 


(c) Use the class equation to show that |N] = [Cil + - +- + ICd, where C;,.. 
C; are all the conjugacy classes of G that are contained in N. 


15. If N + (e) is a normal subgroup of G and |G| = p", prove that N N Z(G) + (e). 
[Hint: Exercise 14(c) may be helpful.] 


16. Complete the proof of Theorem 9.24. 
17. Prove Theorem 9.25. 


Sanit } 


18. If K is a Sylow p-subgroup of Gand H isa subgroup that contains N(K), 
prove that [G:H] = 1 (mod p). 


19. If K is a Sylow p-subgroup of G, prove that N(N(K)) = MK). 


20. If H is a proper subgroup of G, prove that G is not the union of all the 
conjugates of H. [Hint: Remember that H is a normal subgroup of M(H); 
Theorem 9.25 may be helpful.] 


21. If His a normal subgroup of Gand H is a subgroup of G with |H} = p*, 
prove that H is contained in every Sylow p-subgroup of G. [You may assume 
Exercise 24.] 


C. 22. If |G| = p", prove that G has a normal subgroup of order p""'. [Hint: You may 
assume Theorem 9.27 below. Use induction on”. Let N = (a) , where a E Z(G) 
has order p (Why is there such an a?); then G/N has a subgroup of order p75 
use Theorem 8.24.] 


23. If |G} = p", prove that every subgroup of G of order p"! is normal. 


24. If His asubgroup of Gand H has order some power of p, prove that 
H is contained in a Sylow p-subgroup of G. [Hint: Proceed as in the 
proofs of the Second and Third Sylow Theorems but use the relation of 
H-conjugacy instead of P-conjugacy on the set {K,,..., K,} of all Sylow 
p-subgroups.] 
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9.5 The Structure of Finite Groups 


The tools developed in Sections 9.1-9.4 are applied here to various aspects of the 
classification problem. In particular, all groups of orders $15 are classified. We begin 
with some useful facts about p-groups. 


Theorem 9,27 


If G is a group of order p”, with p prime and n = 1, then the center Z(G) 
contains more than one element. In particular, |Z(G)| = p* with 1 =k = n. 


Proof» By Lagrange’s Theorem, |Z(G)| = p* with 0 = k = n. We now show that 
k = 1, that is, that |Z(G) = p. Form (3) of the class equation (page 306) 
shows that 

|Z(G)| =|G| -Gil — |G) = -+ - IG 


where each |Cjis a number larger than 1 that divides |G} Since |G| = p", 
the divisors of |G] larger than 1 are positive powers of p. Therefore, each 
|Cis divisible by p. Since |G] is also divisible by p, it follows that p divides 
|Z(G) and, hence, |Z(G)|= p. m 


Corollary 9.28 


If p is a prime and n > 1, then there is no simple group of order p”. 


Proof» if Gis a group of order p", then Z(G) is a normal subgroup. If Z(G) + 
G, then Gis not simple. If Z(G) = G, then G is abelian and not simple by 
Theorem 8.25. E 


Corollary 9.29 


lf G is a group of order p°, with p prime, then G is abelian. Hence, G is 
isomorphic to Zp or Zp X Zp 


EXAMPLE 1 


By Corollary 9.29, every group of order 9 is isomorphic to Zy or Z; X Z3. 
Similarly, the only groups of order 169 = 13? (up to isomorphism) are Z;s and 
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Proof of Corollary 9.29> Z(G) has order p or p? by Lagrange’s Theorem and 
Theorem 9.27. If Z(G) has order p°, then G = Z(G), which means that 
Gis abelian. If Z(G) has order p, then the quotient group G/Z(G) has 
order |G|/|Z(G)| = p?/p = p by Theorem 8.13. Hence, G/Z(G) is cyclic by 
Theorem 8.7. Therefore, Gis abelian by Theorem 8.15. The last state- 
ment of the theorem now follows immediately from the Fundamental 
Theorem of Finite Abelian Groups. W 


In Corollary 9.18 certain groups of order pq (with p, q prime) were characterized. 
We can now extend that argument to some groups of order Pq: 


Theorem 9.30 


Let p and q be distinct primes such that q # 1 (mod p) and p? # 1 (mad q). If G 
is a group of order pq, then G is isomorphic to Z», or Z, X Zp X Zo 


EXAMPLE 2 


Theorem 9.30 allows us to classify all groups of order 45. Note that 45 = 3? 5, 
and that 5 ¥ 1 (mod 3) and 3? # 1 (mod 5). So if Gis a group of order 45, 
then by Theorem 9.30 (with p = 3 and q = 5), G is isomorphic to Z45 or to 
Z,X Z, X Z,. Similar arguments may be used to classify groups of many differ- 
ent orders, including 


9 =9: 11, 153 = 9: 17, 175 = 25-7, 245 = 49 + 5, 
325 = 25 - 13, 539 = 49 + 11. 


Proof of Theorem 9.30 > By the Third Sylow Theorem, the number of Sylow 
p-subgroups of G is congruent to 1 modulo p and divides |G}. Since the 
divisors of |Glare 1, p, p’, q, pg, and pq, the only possibilities are 1 and 
q. There cannot be q of them because q # 1 (mod p). Hence, there is a 
unique Sylow p-subgroup H, which is normal by Corollary 9.16. Similarly, 
G has 1, p, or p° Sylow g-subgroups, and neither p nor p° is possible since 
pP # 1 (mod q). Hence, there is a unique normal Sylow q-subgroup K. 
The order of the subgroup H N K must divide both |H| = p? and |K] = q by 
Lagrange’s Theorem. Hence, H N K = (e). Furthermore, HK = G 
by Exercise 15 in Section 9.3. Therefore, G = H X K by Theorem 9.3. 
Now His isomorphic to Z, or Z, X Z, by Corollary 9.29 and K = Z, 
by Theorem 8.7. Consequently, by Lemma 9.8, G = H X K = 
Zpx@,=Z,0G=HXK=2Z,x2Z,xz, B 


Corollary 9.31 
If p and q are distinct primes, then there is no simple group of order p’q. 
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Proof» Suppose G is a group of order p’g. If either p* # 1 (mod q) or q # 1 
(mod p), then the proof of Theorem 9.30 shows that G has a normal 
Sylow subgroup and, hence, is not simple. If both p? = 1 (mod q) and 
q = 1 (mod p), then q |(p? — 1) and p | (q — 1), which implies that ps 
q — 1 or, equivalently, q = p + 1. Since p? — 1 = (p — 1)(p + 1), we 
know that q | (p — 1) or g|(p + 1) by Theorem 1.5. The former is impos- 
sible because q = p + 1, and the latter implies that q = p + 1, so that 
q =p + 1. Since p and q are primes, the only possibility is p = 2 and 
q = 3. Exercise 2 shows that no group of order 27-3 = 12 is simple. m 


Dihedral Groups 


We now introduce a family of groups that play a crucial role in the classification of 
groups of order 2p. Recall that the group D, consists of various rotations and reflections 
of the square (see Section 7.1 or 7.1.A). This idea can be generalized as follows. Let P 
be a regular polygon of n sides (n = 3).* For convenient reference, assume that P has its 
center at the origin and a vertex on the negative x-axis, with the other vertices numbered 
counterclockwise from this one, as illustrated here in the cases n = 5 and n = 6. 


Think of the plane as a thin sheet of hard plastic. Cut out P, pick it up, and replace it, not 
necessarily in the same position, but so that it fits exactly in the cut-out space. Such a motion 
is called a symmetry of P.t By considering a symmetry as a function from P to itself and 
using composition of functions as the operation (gf means motion f followed by motion g), 
the set D,, of all symmetries of P forms a group, called the dihedral group of degree z. 


Theorem 9,32 


The dihedral group D, is a group of order 2n generated by elements r and d 
such that 


rl=na,  |el=2, and = dr=rd, 


Proof» The proof that D, is a group is left to the reader. Let r be the counter- 
clockwise rotation of 360/7 degrees about the center of P; r sends 
vertex 1 to vertex 2, vertex 2 to vertex 3, and so on. Note that r has 


* “Regular” means thatall sides of P havethe same length and all its vertex angles (each formed by 
two adjacent sides) are the samesize. It can be shown that the perpendicular bisec tors of the n sides 
all intersect at a single point, which is called the center of P. 


tAII motions that result in the same final position for P are considered to be the same. 
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order n because r” is a 360° rotation that returns P to its initial position 
(the identity symmetry). Let d be the reflection in the x-axis, As shown in 
the following figure, d “reverses the orientation” of P: vertices that were 
formerly numbered counterclockwise from vertex | are now numbered 
clockwise: 


The element d has order 2 because reflecting twice in the x-axis also 
returns P to its initial position. 

Since adjacent vertices of P remain adjacent under any symmetry, 
the final position of P is completely determined by two factors: the 
new orientation of P (whether the vertices are numbered clockwise 
or counterclockwise from vertex 1) and the new location of vertex 1. 
Consequently, every symmetry is the same as either 


r= (0Si<n) [Counterclockwise rotation of i(360/n) 
degrees that preserves orientation and moves 
vertex l to the position originally occupied by 
vertex i+ I] 


or 
rd (0sSi<n) [Reflection in the x-axis that reverses 
orientation followed by a counterclockwise 
rotation that moves vertex 1 to the position 
originally occupied by vertex i + I] 
Therefore 


D, = {e = P,r, P, ..., MEP TTS, . . . OM. 


Furthermore, the 2n elements listed here are all distinct (* and * move 

vertex 1 to different positions and z‘ = r/d is impossible since ? preserves 

the vertex orientation, but r/d reverses it). Hence, D, is a group of order 2n. 
Finally, verify that drd moves vertex 1 to the position originally 

occupied by vertex n and leaves the vertices in counterclockwise order. 

In other words, drd is the rotation that moves vertex 1 to vertex n, that 

is, drd = P|. Since r has order n, r~! = 7"! and, hence, drd = r. 

Multiplying on the right by d shows that dr =r—1d. W 


We can now classify another family of groups. 


Theorem 9,33 


If G is a group of order 2p, where p is an odd prime, then G is isomorphic to 
the cyclic group Z,, or the dihedral group Dp. 
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EXAMPLE 3 


Theorem 9.33 can be used to classify all groups of orders 6, 10, 14, 22, 26, 34, 
etc. For instance, every group of order 22 is isomorphic either to Zy or Dy), 
and every group of order 38 is isomorphic either to Z, or Dig. Theorem 9.33 
also provides a second proof that there are exactly two nonisomorphic groups 
of order 6. (See Theorem 8.9 for the first proof.) 


Proof of Theorem 9.38» G contains an element a of order pand an element b of 
order 2 by Cauchy’s Theorem (Corollary 9.14). Note that b? = e implies 
b57! = b. Let H be the cyclic group (a). Since |G| = 2p, the subgroup 
H has index 2 and is, therefore, normal by Exercise 23 of Section 8.2. 
Consequently, bab = bab~' € H. Since H is cyclic, bab = a! for some t. 
Using this and the fact that 5? = e, we see that 


j 


a = (a‘)' = (bab)! = (bab)(bab)(bab) - - - (bab) = ba'b = b(bab)b = a 


Hence, f?.= 1 (mod p) by part (2) of Theorem 7.9. Consequently, 
p divides ? — 1 = (t — 1)(¢ + 1), which implies that p|(t — 1) or p|(¢ + 1) 
by Theorem 1.5. Thus ¢ = 1 (mod p) or t = —1 (mod p). 

If t = 1 (mod p), then bab = a’ = a by Theorem 7.9. Multiplying 
both sides by b shows that ba = ab. It follows that ab has order 2p = |G| 
(Exercise 33 of Section 7.2). Therefore, G is cyclic and isomorphic to Z,, 
by Theorem 7.19. 

If t = —1 (mod p), then bab = a~'. Exercise 9 shows that the map 
fD, — G given by f (rd) = db’ is a homomorphism. Let K be the 
subgroup (b). Since |H| = p (with p odd) and|K| = 2, H N K = (e) by 
Lagrange’s Theorem and G = HK by Exercise 15 in Section 9.3. Thus 
every element of G can be written in the form a‘b/, which implies that f 
is surjective. Since D, and G have the same order, f must be injective and, 
hence, an isomorphism. W 


Groups of Small Order 


We are now in a position to complete the classification of groups of small order that 
was begun in Section 8.1, where groups of orders = 7 were classified. We already 
know three abelian groups of order 8 (Z, X Z, X Z3, Z4 X Z,, and Z,) and one nona- 
belian one (D,). Another nonabelian group of order 8, the quaternion group Q, was 
introduced in Exercise 16 of Section 7.1. It is not isomorphic to D, by Exercise 47 of 
Section 7.4. These five groups are the only ones: 


Theorem 9,34 


If G is a group of order 8, then G is isomorphic to one of the following groups: 
Za Z4 X Lo, Z X Ly X Lo, the dihedral group D4 or the quaternion group Q. 
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Proof» if Gis abelian, then G is isomorphic to Zg, Z4 X Z}, or Z, X Z, X Z, by the 
Fundamental Theorem of Finite Abelian Groups. So suppose G is a nona- 
belian group of order 8. The nonidentity elements of G must have order 2, 
4, or 8 by Lagrange’s Theorem. However, G cannot contain an element of 
order 8 (because then G would be cyclic and abelian), nor can all the non- 
identity elements of G have order 2 (see Exercise 27 of Section 7.2). Hence, 
Gcontains an element a of order 4. Let b be any element of Gsuch that 
b & (a) = {e, a, a’, a}. Then the eight elements e, a, a’, a’, b, ab, ab, a*b 
are all distinct because jaj = 4 and a’ = a/b implies b = a4 € (a), contrary 
to the choice of b. Thus G = {e, a, a’, a’, b, ab, a’b, a’b}. 

The subgroup (a) has order 4 and index 2 in G. Hence, (a) is normal by 
Exercise 23 of Section 8.2. Now the element bab“ has order 4 by Exercise 19 
of Section 7.2 and bab™ € (a) by normality. Therefore, bab“ is either a or a” 
(because e has order 1 and a” has order 2). If bab™! = a, however, then 
ba = ab, which implies that Gis abelian. Therefore, bab! = a’ = a™ so that 
ba = a“'b. This fact can be used to construct most of the multiplication table 
of G. For instance, (ab)a? = a(ba)a = afa~'b)a = ba = a'b = œb. You can 
use similar arguments to verify that the table must look like this: 


a b ab ah’ 


e a a 
e e a a @ 
a a a a e ab @b a&b b 
2 a a e a ab ab b ab 
P a e a a ab b ab ab 
b b ab a@b ab 
ab ab b ab a’b 


In order to complete the table, we must find b°. Since b? = a'b implies b 
= a'e (a), which is a contradiction, b? must be one of e, a, a”, or a’. If b? 
= a, however, then ab = b?b = bb? = ba, which implies that G is abelian. 
Similarly, & = a’ implies that G is abelian (Exercise 15). Therefore, b? = 
e or b = a’. Each of these possibilities leads to a different table for G. 
Completing the table when ¿+? = e and comparing it to the table for D4 in 
Example 1 of Section 8.2 shows that G = D, under the correspondence 


a—>r, b—d, ab—h, a&b—t abv 


(Exercise 4). Similarly, completing the table when 5? = a” and comparing it 
to the table for the quaternion group Q shows that G = Q (Exercise 5). E 


According to the Fundamental Theorem of Finite Abelian Groups there are two 
abelian groups of order 12: Z, X Z, = Zy and Z, X Z, X Z3. We have also seen two 
nonabelian groups of order 12: the alternating group A, and the dihedral group Dg. It 
can be shown that there is a third nonabelian group T of order 12, which is generated 
by elements a and b such that |a| = 6, & = a’, and ba = ad and that no two of these 
three nonabelian groups are isomorphic (Exercise 16). 
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Theorem 9,35 


If G is a group of order 12, then G is isomorphic to one of the following groups: 
Zin Z X Z; X Z, the alternating group A4, the dihedral group Dg, or the 
group T described in the preceding paragraph. 


Proof» An argument similar to the proof of Theorem 9.34 can be used to prove 


the theorem. See Theorem II.6.4 in Hungerford [5]. E 


The preceding results provide a complete classification of all groups of orders = 15, 
that is, a list of groups such that every group of order = 15 is isomorphic to exactly one 


group on the list. 
ORDER GROUPS REFERENCE 
2 Z Theorem 8.7 
3 Z Theorem 8.7 
4 Zp ZX Z Theorem 8.8 
5 Zs Theorem 8.7 
6 Zo $3 Theorem 8.9 
7 2, Theorem 8.7 
8 Zs, Z4 X Za, Z3 X Za X Ly, Da, Q Theorem 9.34 
9 Zo, Z3 X £3 Corollary 9.29 
10 Zio, Ds Theorem 9.33 
11 Zn Theorem 8.7 
12 Zin, Z2 X Z X 23, Ag, De, T Theorem 9.35 
13 Zi Theorem 8.7 
14 Zia Dy Theorem 9.33 
15 Zis Corollary 9.18 


This list could be continued to order 100 and beyond. For more than half of the 
orders between 2 and 100, the techniques presented above provide a complete clas- 
sification of groups of that order (Exercise 6). For other orders, however, a great deal 
of additional work would be necessary. For instance, there are 14 different groups of 
order 16 and 267 of order 64. There is no known formula giving the number of distinct 
groups of order n. 


E Exercises 


A. 1. If pand qare primes with p < q and q # 1 (mod p) and Gis a group of order 
P’@, prove that Gis abelian. 


2. Prove that there is no simple group of order 12. [Hinz: Show that one of the 
Sylow subgroups must be normal.] 


3. Prove that D3 is isomorphic to 5g. 
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4. (a) In the proof of Theorem 9.34, complete the operation table for the group 
Gin the case when 5? = e. 


(b) Show that G = D, under the correspondence 
a—>r, b—d, ab—>h, ab—t, ab—u 


by comparing the table in part (a) with the table for D, in Example 1 of 
Section 8.2. 


5. (a) In the proof of Theorem 9.34, complete the operation table for the group 


Gin the case when b? = a’. 


(b) Show that G= Q under the correspondence 
ab’ — i'}* (Osrs3,0ss51) 


by comparing the table in part (a) with the table for Q (see Exercise 16 in 
Section 7.1). 


6. Theorems 8.7, 9.7, 9.30, and 9.33, and Corollaries 9.18 and 9.29 are sufficient 
to classify groups of many orders. List all such orders from 16 to 100. 


B. 7. If Gis a group such that every one of its Sylow subgroups (for every prime p) is 
cyclic and normal, prove that Gis a cyclic group. 


8. Let n = 3 bea positive integer and let G be the set of all matrices of the forms 


l a = i 
G <i or + a with a E Z,. 


(a) Prove that G is a group of order 2n under matrix multiplication. 
(b) Prove that G is isomorphic to D,- 


9. Complete the proof of Theorem 9.33 by showing that when bab = a™', the 
map fD, — G given by f(r'd") = ab! is a homomorphism. [Hint: bab = a7 is 
equivalent to ba = a~'b. Use this fact and Theorem 9.32 to compute products 
in Gand Dp] 


10. Prove that the dihedral group De is isomorphic to $3 X Z}. 
11. (a) If n = 2k, show that r* is in the center of D,. 

(b) If 2 is even, show that Z(D,) = {e, r*}. 

(c) If xis odd, show that Z(D,) = {e}. 


12. In Theorem 9.32, r is used to denote a rotation. To avoid confusion here, r will 
denote the 60° rotation in D, and F will denote the 120° rotation in D,. The 
proof of Theorem 9.32 shows that the elements of De can be written in the 
form r‘d/, and the elements of D, in the form rid’. 


(a) Show that the function ¢:D, —> D; given by ¢(r'd!) = r'dJis a surjective 
homomorphism, with kernel {r°, r°}. 


(b) Prove that D,/Z(Dg) is isomorphic to D}. [Hint: Exercise 11.] 
13. What is the center of the quaternion group Q? 
14. Show that every subgroup of the quaternion group Q is normal. 


Copyrigts 2012 Cangnge Loanne AR Right Caamved May not ba copied, esmai. or dupficmat, in whose or is parl Dep to elactronic dghtp, ama thicd pasty contest May be mppmaad fen the eBoct emir eChagear(s). Edimwial eeview has 
Gnemmed that acy appn atan dose oct eumadaNy dfc the overt Learning arpea Cangege Learning maacves the right 0) amva efidixtonal cuntent af aay tne if sthenquem rights cestrictines require it. 


320 Chapter9 Topics in Group Theory 


15. If Gis a group of order 8 generated by elements a and b such that |a| = 4, 
b ¢ (a), and b? = a’, then Gis abelian. [This fact is used in the proof of 
Theorem 9.34, so don’t use Theorem 9.34 to prove it.] 


16. Let G be the group S, X Z, and let a = ((123), 2) and b = ((12), 1). 
(a) Showthat j| = 6, b? = a’,and ba = a8. 
(b) Verify that the set T = {e = a’, a’, a’, a’, a’, a’, b, ab, a*b, a°b, atb, a°b} 
consists of 12 distinct elements. 


(c) Show that T is a nonabelian subgroup of G. [Hint: Use part (a) and 
Theorem 7.12.] 


(d) Show that T is not isomorphic to D, or to Ay. 


17. Let n be a composite positive integer and p a prime that divides n. Assume 
that 1 is the only divisor of n that is congruent to 1 modulo p. If Gis a group 
of order n, prove that G is not simple. 


18. If Gis a simple group that has a subgroup K of index n, prove that |G| 
divides n!. [Hint: Let T be the set of distinct right cosets of K and consider 
the homomorphism 9:G— A(T) of Exercise 41 in Section 8.4. Show that gis 
injective and note that 4(T) = S, (Why?).] 


C. 19. Classify all groups of order 21 up to isomorphism. 
20. Classify all groups of order 66 up to isomorphism. 


21. Prove that there is no simple nonabelian group of order less than 60. 
[Hint: Exercise 18 may be helpful.] 
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CHAPTER [0 


Arithmetic in Integral Domains 


In Chapters 1 and 4 we saw that the ring Z of integers and the ring F[x] of polynomi- 
als over a field F have very similar structures: both have division algorithms, great- 
est common divisors, and unique factorization into primes (irreducibles). In this 
chapter we find conditions under which these properties carry over to arbitrary 
integral domains, with particular emphasis on unique factorization. 

Unique factorization turns out to be closely related to the ideals of a domain. On 
the one hand, unique factorization is not possible unless the principal ideals of the 
domain satisfy certain conditions (Section 10.2). On the other hand, ideals can be 
used to restore a kind of unique factorization to some domains that lack it. Indeed, 
ideals were originally invented just for this purpose, as we shall see in Section 10.3. 

Section 10.4 (The Field of Quotients of an Integral Domain) is independent of 
the rest of the chapter and may be read at any point after Chapter 3. Sections 10.2 
and 10.3 depend on Chapter 6, but the rest of the chapter may be read after 
Chapter 4. 

The interdependence of the sections of this chapter is shown below. The 
dashed arrows indicate that Sections 10.2, 10.3, and 10.5 depend only on the first 
part of Section 10.1 (pages 322-324) and that Section 10.5 uses only three results 
in Section 10.2, all of which can be read independently of the rest of that section. 


itz... 


A shortened version of Sections 10.1 and 10.2 that contains all the basic informa- 
tion may be obtained by omitting the last parts of each of these sections (see the 
notes on pages 325 and 337). 


321 
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10.1 | Euclidean Domains 


In early chapters we analyzed the structure of Z and the polynomial ring F[x] by using 
divisibility, units, associates, and primes (irreducibles). We begin by defining these con- 
cepts in the more general setting of an integral domain.* 


Throughout this chapter, R is an integral domain. 


Let a, bE R, with a nonzero. We say that a divides h (or a is a factor of b) and write 
a | b if b = ae for some cE R. Recall that an element u in R is a unit provided that 
uv = lp for some vE R. Thus the units in R are precisely the divisors of lp. 


EXAMPLE 1 


The only units in Z are ] and —1. If F is a field, then the units in the polyno- 
mial ring F [x] are the nonzero constant polynomials (Corollary 4.5). 


EXAMPLE 2 


The set Z[V2] = {r + sV/2|r, sE Z} is a subring of the real numbers (Exercise 1). 
The element 1 + V2 isa unit in Z[V2] because 


(1 + V2(-1+ V2) =1. 


The ring in the preceding example is one of many similar rings that will frequently be 
used as examples later. If dis a fixed integer, then it is easy to verify that the set Z[Vd] 
= {r + sVd |r, s EZ} is an integral domain that is contained in the complex numbers. If 
d= 0, then Z[VV/d] is a subring of the real numbers (Exercise 1), When d = —1, then the 
ring Z[V—1] is usually denoted Z[/ and is called the ring of Gaussian integers. 


Remark Let we R be a unit with inverse v, so that uv = lg. For any b E R we 
have u(vb) = (uv)b = lgb = b. Therefore, 


a unit divides every element of R 


An element a E R is an associate of bE R provided a = bu for some unit u. Now, u 
has an inverse, say uv = lp, and v is also a unit. Multiplying both sides of a = bu by v 
shows that av = buv = blg = b. Use these facts to verify that 


a is an associate of 4 if and only if b is an associate of a 


and 


a nonzero element of R is divisible by each of its associates. 


*The basic definitions apply in any commutative ring with identity. We restrict our attention to 
integral domains because most of the theorems fail in nondomains. 
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EXAMPLE 3 


Every nonzero integer n has exactly two associates in Z, n and —a. If Fis a 
field, the associates of f(x) € F[x] are the nonzero constant multiples of f(x). 
In the ring Z[V2], the elements V2 and 2 — V2 are associates because 
V2 = (2 — V2\1 + V2) and 1 + V2isa unit by Example 2. 


A nonzero element p E R is said to be irreducible provided that p is not a unit and 
the only divisors of p are its associates and the units of R. 


EXAMPLE 4 


The irreducible elements in Z are just the prime integers because the only divi- 
sors of a prime pare +p (its associates) and +1 (the units in Z). The definition 
of irreducible given above is identical to the definition of an irreducible polyno- 
mial in the integral domain F[x], when Fis a field (see Section 4.3). In Section 10.3 
we shall see that 1 + iis irreducible in the ring Z[i]. 


The next theorem is usually the easiest way to prove that an element is irreducible 
and is sometimes used as a definition. Theorem 4.12 is the special case when R = F[x]. 


Theorem 10,1 


Letp beanonzero,nonunitelementin an integral domain R. Thenp is irreducible 
if and only if 


whenever p = rs, then r or s is a unit. 


Proof» 1f p is irreducible and p = rs, then r is a divisor of p. So r must be either 
a unit or an associate of p. If r is a unit, there is nothing to prove. If r is 
an associate of p, say r = pv, then p = rs = pus. Canceling p on the two 
ends (Theorem 3.7) shows that lz = vs. Therefore, s is a unit. 

To prove the converse, suppose p has the stated property. Let c be any 
divisor of p, say p = cd. Then by hypothesis either c or dis a unit. If d 
is a unit, then so is d~’. Multiplying both sides of p = ed by d ' shows 
that c = d'p. Thus in every case c is either a unit or an associate of p. 
Therefore, p is irreducible. W 


Euclidean Domains 


The Division Algorithm was a key tool in analyzing the arithmetic of both Z and 
F[x]. So we now look at domains that have some kind of analogue of the Division 
Algorithm. To see how to describe such an analogue, note that the degree of a poly- 
nomial in F[x] can be thought of as defining a function from the nonzero polynomials 
in F[x] to the nonnegative integers. By identifying the ley properties of this function 
we obtain this 


(Copyright 2012 Caagagp Lemming. All Qight Camrve May act be copied, armen, or dopticannd, in whale or in part Dus to elacaranic cights, ama third party contam mey be mppmamad frun the eBook zatte o hapanrfa). Edimrial seview has 
Gened that any armad coated! does oct extedely affect the ovni larning asperieare Campig? Lemming Marve tes right to eave efidicionel Coutnat mt asy time if suhampu sights comrictims ragaie it. 


324 Chapter 10 Arithmetic in Integral Domains 


Definition An integral domain R is a Euclidean domain if there is a function ô from 
the nonzero elements of Rto the nonnegative integers with these properties; 


(i) if a and b are nonzero elements of R, then (a) = d{ab). 


(iiy a, b ER and b # Op, then there exist g, r ER such that a = bg +r 
and either r = O, or 8(r) <.8(b). 


EXAMPLE 5 


If Fis a field, then the polynomial domain F[x] is a Euclidean domain with 
the function ô given by 5(f(x)) = degree of f(x). Property (i) follows from 
Theorem 4.2 because 


5(f(x)g()) = deg f(x)g(x) = deg f(x) + deg g(x) 
= deg f(x) = (/@)), 


and property (ii) is just the Division Algorithm (Theorem 4.6). 


EXAMPLE 6 


Z is a Euclidean domain with the function 6 given by 8(a) = |a|. Property (i) 
holds because |ab| = |a||b| = |a| for all nonzero a and b. If a, b EZ, with b > 0, 
then by the Division Algorithm (Theorem 1.1) there are integers q and r such that 
a= bq +rand0 =r <b. Either r = 0, or r and b are both positive, in which 
case, 6(r) = |r| = + < b = |b| = (b). Therefore, property (ii) holds when b > 0. 
For the case when b < 0, see Exercise 9. 


EXAMPLE 7 


We shall prove that the ring of Gaussian integers Z[i] = {s + ti | s,teZ}isa 
Euclidean domain with the function ô given by (s + ti) = s? + 22. Sinces + ti = Oif 
and only if both s and ¢ are 0, we see that 5(s + ti) = 1 when s + ti + 0. Verify 
that for any a = s + ti and b = u + vi in Z[i], 5(ab) = 5(a) 5(6) (Exercise 17). 
Then when b # 0 we have 


8(a) = 8(a) « 1 = 8(a)5(b) = 8(ab), 


so that property (i) holds. If b # 0, verify that a/b is a complex number that can 
be written in the form c + di, where c, de Q (Exercise 11). Since c € Q, it lies 
between two consecutive integers; and similarly for d. Hence, there are integers 
mand n such that |m — e| = 1/2 and |n — d| = 1/2. Since a/b = c + di, 
a = ble + di] = b[(c -m + m) + (d-n + ni] 

= b|(m + ni) + (ce — m) + (d - ni] 

= b[m + ni] + b[(c — m) + (d — n)i] 

=bq+r, 
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where q = m + nie Z[i] and r = b[(c — m) + (d — n)i]. Since r = a — bq anda, 
b, gE Z[i], we see that r € Z[i]. Property (ii) holds because 
S(r) = 8(5)8[(c — m) + (d — n)i] = 5(B)[(c — m? + (d — ny] 
= 5(b)(1I/2)° + (1/2)"] = (1/2) - 8) < (6). 


NOTE: The remainder of this section is optional. The development here is 
elementary and assumes only the basic facts about rings in Section 3.1. A 
more sophisticated approach is presented in Section 10.2, where ideals are 
used to develop the key facts about a wider class of domains that includes 
Euclidean domains as a special case. Thus this section develops some re- 
markably strong results with a minimum of mathematical tools, whereas 
Section 10.2 obtains the same results more efficiently in a wider setting. 


It is possible that a given integral domain may be made into a Euclidean domain 
in more than one way by defining the function 5 differently (see Exercises 12 and 13). 
Whenever the Euclidean domains in the preceding examples are mentioned, however, 
you may assume that the function 6 is the one defined above. 

In F[x], the units are the polynomials of degree 0 (Corollary 4.5), that is, the poly- 
nomials that have the same degree as the identity polynomial 1; Furthermore, if k is 
a constant (unit in F[x]), then f(x) and k/(x) have the same degree. Analogous facts 
hold in any Euclidean domain. 


Theorem 10,2 


Let R be a Euclidean domain and u a nonzero element of R. Then the following 
conditions are equivalent: 


(1) u isa unit. 
(2) 5(u) = (1p). 
(3) 8(c) = S(uc) for some nonzero cER. 


Proof» (1) => (2) Exercise 15. 
(2) = (3) Statement (3) holds with c = 1p because 5(1p) = 5(u) = ô(u * 1p). 


(3) = (1) According to (ii) in the definition of a Euclidean domain (with c 
and uc in place of æ and b), there exist q, r E R such that 


c= (ucq +r andeither r= 0k or 5(r) < 8(uc). 
If ô(c) < S(uc), then by part (i) of the definition (with c and lp — uq in 
place of a and b) and statement (3), 
5(c) 5 S(p — u4)) = 8(c — ueg) = 8ft) < 5(uc) = (0), 


so that 5(c) < ê(c), a contradiction. Hence, we must have r = 0g. Thus 
c = (uc)q, which implies that lg = ug. Therefore, u isa unit. W 
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In the remainder of this section we shall develop the basic facts about greatest com- 
mon divisors, irreducibles, and unique factorization in Euclidean domains, The devel- 
opment here parallels the ones given in Chapter 1 for Z and in Chapter 4 for F[x] and 
most of the arguments are the same ones used there, with appropriate modifications. 
Alternatively, the major results in Sections 1.2-1.3 and 4.2-4.3 may be considered as 
special cases of the theorems proved here. 


Greatest Common Divisors 


The integers are ordered by = and polynomials in F[x] are partially ordered by their 
degrees. This made it natural to define greatest common divisors in these domains in 
terms of size or degree. The same idea carries over to Euclidean domains, where “size” 
is measured by the function ô. 


Definition Let R be a Euclidean domain and a, bER (not both zero). A greatest 


common divisor of a and b is an element d such that 


i) d | aandd | b; 
(ii) ifc | aande | b, then &(c) = &(d). 


Any two elements of a Euclidean domain R have at least one common divisor, 
namely lp. If c|a, say a = ct, then 5(c) S ê(ct) = ê(a). Consequently, every common 
divisor c of a and b satisfies 5(c) Ss max {5(a), 5(b)}, which implies that there is a 
common divisor of largest possible ô value. In other words, greatest common divisors 
always exist. 

When gcd’s were defined in Z and F[x], an extra condition was included in each 
case: The gcd of two integers is the positive common divisor of largest absolute value 
and the gcd of two polynomials is the monic common divisor of highest degree. These 
extra conditions guarantee that greatest common divisors in Z and F[x] are unique. 
In arbitrary Euclidean domains there are no such extra conditions and greatest com- 
mon divisors are not unique. Thus the preceding definition is consistent with, but not 
identical to, what was done in Z and F[x]. 


EXAMPLE 8 


Z is a Euclidean domain with 5(@) = |a|. Under the preceding definition, 2 is 
the gcd of 10 and 18 just as before. However, —2 also satisfies this definition 
because —2 divides both 10 and 18 and any common divisor of 10 and 18 has 
absolute value = |—2|. Note that the greatest common divisors 2 and —2 are 
associates in Z. 
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Theorem 10.3 


Let R be a Euclidean domain and a, b ER (not both zero), 


(1) Ifd is a greatest common divisor of a and b, then every associate of 
d is also a greatest common divisor of a and b. 
(2) Any two greatest common divisors of a and b are associates. 


(3) If d is a greatest common divisor of a and b, then there exit u, vER 
such that d = au + by. 


Proof» (1) Exercise 16. 


We now find a particular greatest common divisor of a and b that will 
then be used to prove statements (2) and (3). Let 


S = {85(w) | 0g # we Rand w = as + bt for some s, t E R}. 


Since at least one of a = alg + bOgand b = apg + blg is nonzero by 
hypothesis, S is a nonempty set of nonnegative integers. By the Well- 
Ordering Axiom, S contains a smallest element, that is, there are 
elements d*, u*, v* of R such that d* = au* + bv* and 


(A) for every nonzero w of the form as + bt (with s, tE R), (dt) = 5(w). 


We claim that ¢* is a greatest common divisor of a and b. To prove 
this we first show that d* | a. By the definition of Euclidean domain, 
there are elements q, r such that a = d*q + randeither r = Og or 
5(r) < 6(a*). Note that 

r =a — d*q = a — (au* + bu*)q 
= a — agu* — bu*g = a(lę — qu*) + b(~v*ą). 
Thus r is a linear combination of a and b, and, hence, we cannot have 
5(r) < 5(d*) by (A). Therefore, r = Op, so that a = d*g and d* | a. A similar 
argument shows that d* | b and, hence, d* is a common divisor of a and b. 


Let c be any other common divisor of a and b. Then a = cs and b = ct 
for some s, tE R and hence 


(B) dt = aut + but = (cs)u* + (ctu* = e(su* + ta*). 


Thus by part (i) of the definition of Euclidean domain 6(c) = 
5(c(su* + tu*)) = 5(d*). Therefore, d* is a greatest common divisor of 
a and b. Note that (B) also shows that 


(C) every common divisor c of a and b divides d*. 


This completes the preliminaries. We now prove the rest of the theorem. 
(2) Let d be any greatest common divisor of a and b. Since d divides 
botha and b and d* is a greatest common divisor, we must have 5(d) = 5(d*) 
by part (ii) of the definition. The same definition with the roles of d and 
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d* reversed shows that 5(d*) = 5(d). Hence, 6(d) = 5(d*). By (C) we 
know that d | d*, say d* = dk. Therefore, 5(d) = 6(d*) = 5(dk). Hence, k 
is a unit by Theorem 10.2 and dis an associate of d*. Since every gcd is 
an associate of d*, any two of them must be associates of each other by 
Exercise 6. 

(3) If d is a greatest common divisor of a and b, then as we saw in the 
previous paragraph d* = dk, with k a unit. Since d* = au* + by*, we have 


d= dk = (au* + but)! = afutk-') + Blok”. 
Hence, d = au + bv, with u = utk andy = otk m 


Corollary 10.4 


Let R be a Euclidean domain and a, b ER (not both zero). Then d is a greatest 
common divisor of a and b if and only if d satisfies these conditions: 


(i) d | aandd | b; 
(ii) ifc |aandc|b,thenc | d. 


Proof»rf disa greatest common divisor of a and b, then d satisfies (i) by defini- 
tion. Suppose ¢ is a common divisor of a and b. Let d* be as in (+++) in 
the proof of Theorem 10.3. Then ¢| d*, say d* = ct. Furthermore, ¢ is 
an associate of d by Theorem 10.3 so that d* = dk, with k a unit. Hence, 
d = d*k = (cik! = c(tk), so that c | d. Therefore, condition (ii) holds. 
The proof of the converse is Exercise 18. W 


The Euclidean Algorithm (Exercise 15 of Section 1.2) provides the most efficient 
way of calculating the greatest common divisor of two integers. With minor modifica- 
tion its proof carries over to Euclidean domains and provides a constructive method 
of finding both greatest common divisors and the coefficients needed to write the ged 
of aand b as a linear combination of a and b. See Exercise 31. 


Unique Factorization 


Elements a and b of a Euclidean domain are said to be relatively prime if one of their 
greatest common divisors is 1p. In any domain the units are the associates of 1p. Thus 
by Theorem 10.3, a and b are relatively prime if and only if one of their greatest 
common divisors is a unit. 


Theorem 10.5 


Let R be a Euclidean domain and a, b, c ER. If a | bc and a and b are relatively 
prime, then a | c. 


Proof» Copy the proof of Theorem 1.4, using Theorem 10.3 in place of 
Theorem 1.2. E 
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Corollary 10.6 


Let p be an irreducible element in a Euclidean domain R. 


(1) ifp | bc, thenp | borp |c. 
(2) If p | a,@ +++ a,, then p divides at least one of the a,. 


Proof» (1) Let d be a greatest common divisor of p and b. Since d divides p, we 
know that d is either an associate of p or a unit. If dis an associate of 
p, then p is also a greatest common divisor of p and b by Theorem 10.3; 
in particular, p |b. If dis a unit, then p and b are relatively prime and, 
hence, p | c by Theorem 10.5. 


(2) Copy the proof of Corollary 1.6, using (1) in place of 
Theorem 1.5. E 


Theorem 10.7 


Let R be a Euclidean domain. Every nonzero, nonunit element of R is the prod- 
uct of irreducible elements,* and this factorization is unique up to associates; 
that is, if 


PsP2'**Pr = N92°*' Ws 


with each p, and q irreducible, then r = s and, after reordering and relabel- 
ing if necessary, 


piis an associate of q, for /=1,2,...,F. 


Proof» Let S be the set of all nonzero nonunit elements of R that are not the 
product of irreducibles. We shall show that S is empty, which proves that 
every nonzero nonunit element has at least one factorization as a prod- 
uct of irreducibles. Suppose, on the contrary, that S is nonempty. Then 
the set {5(s) | s €S} is a nonempty set of nonnegative integers, which 
contains a smallest element by the Well-Ordering Axiom. That is, there 
exists a E S such that 


(*) ô(a) = ô(s)  forevery se&S. 


Since a E S, a is not itself irreducible. By the definition of irreducibility, 

a = be with both b and c nonunits. Now 6(5) = 6(&c) by the definition of 
Euclidean domain. If 5(4) = 6 (6c), then b would be a unit by Theorem 10.2, 
which is a contradiction. Hence, 6(5) < 5(bc) = S(a), so that b Æ S by (+). A 
similar argument shows that c ¢ S. By the definition of S, both b and ¢ are 
the product of irreducibles and, hence, so is a = be. This contradicts the fact 


“We allow tha possibility of a product with just one factor in case the original element is itself 
irreducible. 
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Arithmetic in Integral Domains 


that a € S. Therefore, S is empty, and every nonzero nonunit element of R 
is the product of irreducibles. To show that this factorization is unique up 
to associates, copy the proof of Theorem 4.14, replacing constant by unit 
and Corollary 4.13 by Corollary 10.6. m 


E Exercises 


NOTE: Unless stated otherwise, R is an integral domain. 


A. 1. 
2. 


12. 


13. 


Show that Z[ Vd] is a subring of C. If d = 0, show that Z[Vd] is a subring of R. 


Let d # +1 be a square-free integer (that is, d has no integer divisors of the 
form e? except (+1)’). Prove that in Z[ Vd], r + sVd =r, + s Vd if and only 
if r = r, and s = s}. Give an example to show that this result may be false if d 
is not square-free. 


If the statement is true, prove it; if it is false, give a counterexample: 
(a) If a| band c| din R, then ac | bd. 
(b) If a |b and c |din R, then (a + c) | (b + d). 


. Prove that c and d are associates in R if and only if c |d andd| c. 
. If a = be with a £ 0 and b and c nonunits, show that a is not an associate of b. 


. Denote the statement “a is an associate of b” by a ~ b. Prove that ~ is an 


equivalence relation; that is, for all z, s, tE R: (i) r~ r. (ii) If r ~s, then s ~r. 
(iii) If r~s and s ~t, then r ~t. 


. Prove that every associate of an irreducible element is irreducible. 
. If u and v are units, prove that u and v are associates, 


. Show that the function 6 in Example 6 has property (ii) in the definition 


of a Euclidean domain in the case when b < 0. [Hint: Apply the Division 
Algorithm with a as dividend and |b] as divisor. Then modify the result.] 


. Is 2x + 2 irreducible in Z[x]? Why not? 
. Ifa=s+ tiandd = u + vi are in Zi] and b # 0, show that a/b = c + di, where 
su. + tv tu — su 
A T T 
(a) Show that Z is a Euclidean domain with the function 6 given by 5(n) = n’. 


(b) Is Q a Euclidean domain when 8 is defined by 5(r) = r”? 
Let R be a Euclidean domain with function 6 and let & be a positive integer. 


(a) Show that R is also a Euclidean domain under the function 6 given by 
Olr) = d(r) + k. 


(b) Show that R is also a Euclidean domain under the function $ given by 
Br) = kêr). 
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14. Let F be a field. Prove that Fis a Euclidean domain with the function ô given 
by 8(a) = 0 for each nonzero a E F. 


15. Let R be a Euclidean domain and we R. Prove that u is a unit if and only if 
5(u) = 5 (Ik). 

16. If dis the greatest common divisor of a and b in a Euclidean domain, prove 
that every associate of d is also a greatest common divisor of a and b. 


17. (a)Ifa =s + tiand b = u + viare nonzero elements of Z[i], show that 
&(ab) = 5(a)5(b), where 8(r + s) = P + s*. 


(b) If Ris a Euclidean domain, is it true that 5(ab) = 5(a)5(6) for all nonzero 
a, be R? 


18. Complete the proof of Corollary 10.4 by showing that an element d satisfying 
conditions (i) and (ii) is a greatest common divisor of a and b. 


19. Show that the elements q and r in the definition of a Euclidean domain are 
not necessarily unique. [Hint: In Z[i], leta = —4 + i and b = 5 + 3i; consider 
q=-landą= -1 +i. 


B. 20. If any two nonzero elements of R are associates, prove that R is a field. 


21. If every nonzero element of R is either irreducible or a unit, prove that R is a 
field. 


22. (a) Show that 1 + iis not aunit in Z[i]. [Hint: What is the inverse of 1 + iin C7] 
(b) Show that 2 is not irreducible in Z[i]. 


23. Let p be a nonzero, nonunit element of R such that whenever p | cd, then p|c 
or p | d. Prove that p is irreducible. 


24. If f:R > Sis a surjective homomorphism of integral domains, p is irreducible 
in R, and f(p) # Og, is f (p) irreducible in S? 


25. Let R be a Euclidean domain. Prove that 
(a) (1x) = (a) for all nonzero a E R. 
(b) If a and b are associates, then 5(a) = 5(b). 
(c) If a| b and 5(a) = (b), then a and b are associates. 
26. Show that Z[ V2] is a Euclidean domain with 5(r + s72) = r? + 23. 
27. Let w = (—1 + V—3)/2 and Zw] = {r + sw | r, s EZ}. Prove that Z[w] is 
a Euclidean domain with 5(r + sw) = (r + sw)(r + sa”) = r° — rs + 87, 
[Hint: Note that w° = 1 and w + w + 1 = 0 (Why?) ] 
28. Prove or disprove: Let R be a Euclidean domain; then 
I= {aER | 6(a) > 5(Ig)} is an ideal in R. 
29. Let R be a Euclidean domain. If the function ô is a constant function, prove 
that R isa field. 
30. (a) Prove that 1 — i is irreducible in Z[i]. [Hint: If a|(1 — i), then 1 — i= ab; 
see Exercises 17(a) and 25.] 


(b) Write 2 as a product of irreducibles in Z[i]. [Hint: Try 1 — i as a factor.] 
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C. 31. State and prove the Euclidean Algorithm for finding the gcd of two elements 
of a Euclidean domain. 
32. Let R be a Euclidean domain such that ô(a + b) = max {ô8(a), 5(5)} for all 
nonzero a, be R. Prove that g and r in the definition of Euclidean domain are 
unique. 


Principal Ideal Domains and Unique Factorization 
Domains 


A Euclidean domain is, in effect, a domain that has an analogue of the Division 
Algorithm. Consequently, all the proofs used for the integers and polynomial rings, 
most of which ultimately depended on the Division Algorithm, can be readily carried 
over to Euclidean domains. We now consider domains that may not have an analogue 
of the Division Algorithm but do have the other important arithmetic properties of Z, 
such as unique factorization and greatest common divisors. 


Definition A principal ideal domain (PID) is an integral domain in which every ideal 
is principal. 


The next theorem shows, for example, that Z, Q[x], and Z{ż] are all principal ideal 
domains because all of them are Euclidean domains (see Examples 5—7 of Section 10.1). 
Example 8 of Section 6.1 shows that the polynomial ring Z[x] is not a PID. 


Theorem 10.8 


Every Euclidean domain is a principal ideal domain. 


Proof» Suppose J is a nonzero ideal in a Euclidean domain R. Then the set 
{8(i) | i€ J} is a nonempty set of nonnegative integers, which contains a 
smallest element by the Well-Ordering Axiom. That is, there exists b € J 
such that 


(*) 5(6) = 6@ forevery el. 


We claim that J is the principal ideal (b) = {rb |r € R}. Since be JandI 
is an ideal, rb € I for every r E R; hence, (b) & I. Conversely, suppose c ET. 
Then there exist q, r E R such that 


c=bqtr and r=0, or 5(r) < 5). 


Since r = c — bq and both c and b are in J, we must have r E I. Hence, it is 
impossible to have 5(r) < 8(b) by (+). Consequently, r = Og ande = bg + 
r = bq E (b). Thus I& (b) and, hence, 7 = (b). Therefore, Risa PID. E 
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The converse of Theorem 10.8 is false: There are principal ideal domains that are 
not Euclidean domains (see Wilson and Williams [21]). Thus the class of Euclidean 
domains is strictly contained in the class of principal ideal domains. 

In our development of the integers, polynomial rings, and Euclidean domains we 
first considered greatest common divisors and used them to prove unique factoriza- 
tion. Although this approach could also be used with principal ideal domains, it is 
just as easy to proceed directly to unique factorization.* We begin by developing the 
connection between divisibility and principal ideals in any integral domain. 


Lemma 10.9 


Let a and b be elements of an integral domain R. Then 
(1) (a) = (b) if and only if b | a. 
(2) (a) = (b) if and only ifb | aanda | b. 
(3) (a) & (b) if and only if b | a and b is not an associate of a. 


Proof» (1) Note first that the principal ideal (b) consists of all multiples of b, 
that is, all elements divisible by b. Hence, 


ae(b) if and only if bla. 


Now if (a) & (b), then a is in the ideal (b), so that b | a. Conversely, if 
b | a, then a € (b), which implies that every multiple of a is also in the 
ideal (b). Hence, (a) & (b). 

(2) (a) = (b) if and only if (a) = (b) and (b) & (a). By (1), (a) & (b) and 
(5) & (a) if and only if b |a anda |b. 

(3) To prove this, use (1), (2), and Exercise 4 in Section 10.1, which 
shows that a | b and b |a if and only if b is an associate of a. WE 


To understand the origin of the next definition, it may help to recall the typical 
process for factoring an integer a, as a product of primes. Find a prime divisor p, of 
a, and factor: a, = pia. Next find a prime divisor p, of a, and factor: a, = p43, SO 
as a = P\p2a3. Now find a prime divisor p; of a; and factor again: a, = psa, and 

= P1P2p3a,. Continue in this manner. Since a, has only a finite number of prime 
Seine we must eventually have some a prime so that a = p, 1 and a = 
PiPoPx.** * Py* 1. The only way to continue factoring (with positive factors and with- 
out changing the p’s) is to use the fact that 1 = 1 - 1 repeatedly to write a, as 


= Pipopy- Per Lele. + 1. 


Now look at the same procedure from the point of view of ideals. We have a, | a), a; | a, 
a4|43,...,1|a,,1|1, 1] 1, and so on. Consequently, by Lemma 10.9 this factorization 
process leads to a chain of ideals 


(a1) S (a) 5 (a)S---S(a)S(M)e()e()e--- 


*Greatest common divisors are discussed at the end of this section; also see Exercises 20-22. 
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in which all the ideals are equal after some point. This suggests that factorization as 
a product of irreducibles is somehow related to chains of principal ideals in which all 
the ideals are equal after some point and motivates the following definition. 


Definition An integral domain R satisfies the ascending chain condition (ACC) on 
principal ideals provided that whenever (a;) & (a2) & (a3) S + > « then there 
exists a positive integer n such that (a,) = (an) for all /= n. 


Note that in this definition the identical ideals beginning with (@,) may not be the 
ideal (1p). Nevertheless, the preceding discussion suggests the possibility that Z has the 
ACC on principal ideals. This is indeed the case as we now prove. 


Lemma 10.10 


Every principal ideal domain A satisfies the ascending chain condition on 
principal ideals. 


Proof» 1f (a1) S (@) S - + “is an ascending chain of ideals in R, let A be the set- 
theoretic union Y (a). We claim that A is an ideal. Suppose a, b E4; 
then a €(a,) and be (a) for some j,k = 1. Either j sk or k = J, say j = k. 
Then (a,) & (az), so that a, b E (a,). Since (ag) is an ideal, we know that 
a — bE (a) SA and ra E (az) & A for any r E R. Therefore, A is an ideal by 
Theorem 6.1. Since R is a PID, A = (e) for some c E R. Since A = U (a), 


we know that c E (a,) for some n. Consequently, (c) S (@,) and for eich 
izn 


(an) & (a) sU (a) = A = (6) © (a). 
Therefore, (a, = (a,) foreachizn. E 


As we shall see, Lemma 10.10 is the key to showing that every nonzero nonunit 
element in a PID can be factored as a product of irreducibles. The fact that this fac- 
torization is essentially unique is a consequence of the next lemma. 


Lemma 10.11 


Let R be a principal ideal domain. If p is irreducible in R and p | bc, then p | b or 
pic. 
Proof* »1F P| be, then bc is in the ideal (p). If (p) were known to be a prime 


ideal, we could conclude that b€(p) or cE (p), that is, that p | b or p| c. 
Since every maximal ideal is prime by Corollary 6.16, we need only show 


*For an alternate proof using greatest common divisors in place of Corollary 6.16, see Exercise 23. 
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that (p) is a maximal ideal. Suppose J is any ideal with (p) & I= R. Since 
Risa PID, I = (d) for some de R. Then (p) & (d) = I implies that d | p. 
Since p is irreducible, d must be either a unit or an associate of p. If d 
is a unit, then J = (d) = R by Exercise 9 of Section 6.1. If dis an 
associate of p, say d = pu, then p | d and, hence, (d) & (p). In this case, 
(p) = (d) € (p), so that (p) = (d) = I. Therefore, (p) is maximal, and 
the proof is complete. W 


Theorem 10,12 


Let R be a principal ideal domain. Every nonzero, nonunit element of R is 
the product of irreducible elements,* and this factorization is unique up to 
associates: that is, if 


PxPo*** Pp = W19o*** Fs 


with each p, and q irreducible, then r = s and, after reordering and relabeling 
if necessary, 


p, is an associate of q, for/=1,2,...,F. 


Proof» Let a be a nonzero, nonunit element in R. We must show that a has at 
least one factorization. Suppose, on the contrary, that a is not a product 
of irreducibles. Then a is not itself irreducible. So a = a,b, for some 
nonunits a, and b, (otherwise every factorization of a would include a 
unit and a would be irreducible by Theorem 10.1). If both a, and 5, are 
products of irreducibles, then so is a. Thus at least one of them, say aj, is 
not a product of irreducibles. Since b; is not a unit, a, is not an associate 
of a (Exercise 5 in Section 10.1). Consequently, (a) G (a;) by part (3) of 
Lemma 10.9. 

Now repeat the preceding argument with a, in place of a. This leads 
to a nonzero nonunit a such that (a,) Ẹ (a) and az is not a product of 
irreducibles. Continuing this process indefinitely would lead to a strictly 
ascending chain of principal ideals (a,) Ẹ (a,) G (a) Ẹ - + +, contradict- 
ing Lemma 10.10. Therefore, a must have at least one factorization as a 
product of irreducibles, 

Now we must show that this factorization is unique up to associates. 
To do this, adapt the proof of Theorem 4.14 (the case when R = F[x]) 
to the general situation by replacing the word constant by unit and using 
Lemma 10.11 and Exercise 2 in place of Corollary 4.13. E 


To appreciate the importance of Theorem 10.12, it may be beneficial to examine a 
domain in which unique factorization fails. 


“We allow the possibility of a product with just one factor in case the original element is itself 
irreducible. 
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EXAMPLE 1 


Let Q,[x] denote the set of polynomials with rational coefficients and integer 


: l 2 1 1 
constant terms. For instance, x, 7“ and 2 are in Q,[x], but X+ 2 and 4 are 


not. Verify that Q,[x] is an integral domain and that the constant polynomial 2 
is irreducible in Q,[x] (Exercise 16). The irreducible element 2 is a factor of 


x€Q,[x] because x = 2- (>) Similarly, 2 is an irreducible factor of Lx 


because > =2- G) Hence, x=2.2. G} In fact, the process of 


factoring out irreducible 2’s never ends because 


In view of this, it should not be surprising that x cannot be factored as a prod- 
uct of irreducibles of Q [x] (Exercise 17). 

Compare this situation with the prime factorization of a, in Z as described on 
page 333. In Z the factorization becomes trivial after a finite number of steps (the 
only remaining factors are 1’s), and all the ideals in the corresponding chain are equal 
after that point. In the factorization (+) in Q7[x], however, things are different. The 
remaining factors each time a 2 is factored from x are the elements 


No two of these elements are associates (Exercise 3) and each element is 2 times 
the following one, that is, each element is divisible by the following one. Therefore, 
by part (3) of Lemma 10.9 


os(})s(i)s(h)s--- 


Hence, the ACC for principal ideals does not hold in Q3[x]. 


Unique Factorization Domains 


In our study of Euclidean domains and principal ideal domains, the main result was 
that unique factorization held. Now we reverse the process and consider domains in 
which unique factorization always holds to see what other properties from ordinary 
arithmetic they may have. 
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Definition An integral domain R is a unique factorization domain (UFD) provided 

that every nonzero, nonunit element of R is the product of irreducible 

elements,* and this factorization is unique up to associates; that is, if 
PsP" ** Pr = ho gs 


with each p, and q irreducible, then r = s and, after reordering and relabel- 
ing if necessary, 


pı is an associate of q;forj/=1,2,...,6 
EXAMPLE 2 


Theorem 10.12 shows that every PID is a unique factorization domain. In 
particular, the ring Z[i] of Gaussian integers is a UFD. 


EXAMPLE 3 


As noted in Example 1, Q,[x] is not a unique factorization domain because the 
element x has no factorization as a product of a finite number of irreducibles. In 
Section 10.3 we shall see that Z[~/—5] fails to be a UFD for a different reason: 
Every element is a product of irreducibles, but this factorization is not unique. 


EXAMPLE 4 


A proof that the polynomial ring Z[x] is a UFD is given in Section 10.5. Since 
Z[x] is not a principal ideal domain (see Example 8 of Section 6.1), we see that 
the class of all unique factorization domains is strictly larger than the class of 

all principal ideal domains. 


NOTE: The remainder of this section is optional and is not needed for the sequel. 


When working with two integers, you can always arrange things so that the same 
primes appear in the factorizations of both elements. For instance, consider the prime 
factorizations —18 = 2 « 3 »(—3) and 40 = 2 + (--2) + (—2) » 5. The list of all primes that 
appear in both factorizations is 2, 3, —3, 2, —2, ~2, 5, but several of these primes are 
associates of each other. By eliminating any prime on the list that is an associate of an 
earlier number on thelist we obtain thelist 2, 3, 5in which no two numbers are associ- 
ates. We can write both 18 and 40 as products of these three primes and the units +1: 


—18 =2+3-(-3)=(-1)+2-3-3 =(-1)-2°-3?- 5 
40 = 2+ (—2)+(—2)+5 =(-1(-1)-2+2+2-5=(1)-23+3°: 5! 


Essentially the same procedure works in any UFD. 


“We allow the possibility of a product with just one factor in case the original element is itself irreducible. 
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Theorem 10,13 


If c and d are nonzero elements in a unique factorization domain R, then 
there exist units u and v and irreducibles p,, po, . . . ı Py NO two of which are 
associates, such that 


C= Upy™py™ ++ p™ and d= ppr" py”, 
where each m,and n, is a nonnegative integer. Furthermore, 
eld  ifandonlyif msn, foreach /f=1,2,...,4 


In the example preceding the theorem, with c= —18 and d = 40, we had u = —1, v= 1, 
Pi = 2, p2 = 3, and p; = 5. 


Proof of Theorem 10.18 > since R is a UFD, both ¢ and d can be factored, say 
€=992°°"q,andd =r, ++ r, with each q, and r, irreducible. In the list 
qis Fas «+ +5 Qs Fts Py». - , r, delete any element that has an associate appear- 
ing earlier on the list and denote the remaining elements by pi, po,..., 
Py- Then each p,is irreducible, no two of them are associates of each other, 
and each one of the q’s and r's is an associate of some p, Consequently, in 
the factorization ¢ = qq, * * * q, each q, is of the form wp, with w a unit. 
By rearranging terms, ¢ can be written (product of units) (product of p’s). 
The product of these units is itself a unit, call it u. By rearranging the p’s 
in this product and inserting other p’s with zero exponents if necessary, 
we can write c = up;"'p7"? ++ > pg™, with each m; = 0. A similar procedure 
works for d and proves the first part of the theorem. 

To prove the first half of the last statement of the theorem, suppose 
c |d. Then d= cb for some b E R. Since the irreducible p, appears exactly 
n, times in the factorization of d, it must also appear exactly n; times in the 
factorization of cb. But p; already appears 7, times in the factorization of c 
and may possibly appear in the factorization of b, so we must have m, = n,. 
Conversely, suppose that m; = n; for every i. Verify that d = ca, where 


a= (u'u) (p mp rm. . » De® ™), 


Therefore, c|d. M 


Corollary 10.14 


Every unique factorization domain satisfies the ascending chain condition on 
principal ideals. 


Proof» First, suppose (c) and (d) are principal ideals ina UFD R such that 
(d) Ẹ (c). Then c| dand cis not an associate of d by Lemma 10.9. If ¢ and 
d are written in the form given by Theorem 10.13, then each m; = n,. If 
m, = n, for every i, then c = uv ~'d, which means that c is an associate of 
d, acontradiction. Hence, there must be some index j for which my <M 
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Suppose (a) & (@) S (a) S » + * is a chain of principal ideals in R. 
Lemma 10.9 shows that each a; divides a,. By Theorem 10.13 we 
may assume that a, = yp,"p;"+ + » p,"* and that each q is of the form 
a, = upp,” + * * py, where the p; are nonassociate irreducibles, If 
there are just a finite number of strict inclusions () in the chain of 
ideals, then there are only equalities after a certain point and the ACC 
holds. There cannot be an infinite number of strict inclusions because 
the first paragraph shows that each time a strict inclusion occurs, one 
of the exponents on one of the p’s must decrease. Consequently, after 
a finite number of strict inclusions, there would be an a, of the form 
a, = upil > + © = p, = u. Thus a, is a unit, which implies that (a,) = R by 
Exercise 9 of Section 6.1. For each i = n we have (a,) & (a) & R = (a,), so 
that (a,) = (a;). Therefore, R satisfies the ACC on principal ideals. W 


Irreducibles in a unique factorization domain have a property that we have 
used frequently in the special cases of Euclidean domains and principal ideal 
domains. 


Theorem 10,15 


Let p be an irreducible element in a unique factorization domain A. If p| be, 
then p]b or p|c. 


Proof» if b or cis Og, then there is nothing to prove because p | Og. If cis a unit 
and p| be, then pt = be for some tE R and pte™! = b. Hence, p | b; simi- 
larly, if b is a unit, then p | c. If both b and c are nonzero nonunits, then 
b=4,'°* qande = qg, * * * q, with the q; (not necessarily distinct) 
irreducibles. Since p | bc, we have pr = be = q; >> * q, for some r E R. The 
irreducible p must be an associate of some q, by unique factorization. 
Therefore, p divides q; and, hence, divides b or c. W 


We are now in a position to characterize unique factorization domains. 


Theorem 10.16 


An integral domain R is a unique factorization domain if and only if 


(1) R has the ascending chain condition on principal ideals; and 
(2) whenever p is irreducible in R and p|cd, then p|c or pid. 


As the proof of the theorem shows, condition (1) corresponds to the existence of 
an irreducible factorization for each nonzero nonunit element and condition (2), to 
the uniqueness of this factorization. The two conditions are independent: (1) fails and 
(2) holds in Q,[x] (see Example 1 and Exercise 33), whereas (1) holds and (2) fails in 
2|V—5] (as we shall see in Example 4 and Exercise 21 of Section 10.3). 
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Proof of Theorem 10.16 » if Ris a UFD, then R satisfies (1) and (2) by Corollary 10.14 
and Theorem 10.15. Conversely, assume R satisfies (1) and (2) and let a 
be a nonzero nonunit element of R. The argument used in the proof of 
Theorem 10.12, which depends only on the ACC, is valid here and shows 
that a can be factored as a product of irreducibles. To show that this 
factorization is unique, adapt the proof of Theorem 4.14 (the case when 
R = F[x] to the general situation by replacing the word constant by unit 
and using (2) and Exercise 2 in place of Corollary 4.13, @ 


Greatest Common Divisors 


Greatest common divisors were a useful tool in our study of Z, F[x], and other Euclidean 
domains. In each case the gcd of two elements was defined to be a common divisor of 
“largest size,” where size was measured by absolute value in Z, by polynomial degree 
in F[x], and by the function 6 in an arbitrary Euclidean domain. Unfortunately, there 
may be no similar way to measure “size” in an arbitrary integral domain, so greatest 
common divisors must be defined in terms of divisibility properties alone: 


Definition Let a, a, ..., &n be elements (not all zero) of an integral domain R. A 
greatest common divisor of 4,, a)... , 2, is an element d of R such that 
(i) d divides each of the a; 


{ii) ife ER and c divides each of the a, then cd. 


Corollaries 1.3, 4.9, and 10.4 show that this definition is equivalent to the definitions 
used previously in Z, F[x], and other Euclidean domains. The only difference is that great- 
est common divisors in Z and F[x], are no longer unique (see the discussion on page 326). 


Theorem 10,17 


Let d be a greatest common divisor of a), a, ..., & Ìn an integral domain R. 
Then 

(1) Every associate of d is also a gcd of a;,...,@) 

(2) Any two greatest common divisors of a, . . . , a, are associates. 


Proof (1) Exercise 7. 


(2) Suppose both dand ¢ are gcd’s of aj, . . . , an. Then ż divides each 

a, and, therefore, t | d by (ii) in the definition of the greatest common 
divisor d. But d also divides each a, and, hence, d| t by (ii) in the defini- 
tion of the ged t. Since ¢ | d and d | t, we know that d and t are associates 
by Exercise 4 of Section 10.1. W 


WARNING: In some integral domains a finite set of elements may not 


have a greatest common divisor (see Exercise 13 in Section 10.3). 
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Theorem 10,18 


Leta;, a, . . » , an (not all zero) be elements in a unique factorization domain R. 
Then a1,...,@, have a greatest common divisor in R. 


Proof» The gcd of any set of elements is the gcd of the nonzero members of the 
set, so we may assume that each a, is nonzero. By Theorem 10.13 there are 
irreducibles p}, .. ., P; (no two of which are associates), units uj, » >. , Up 
and nonnegative integers my such that 


ay = yp pps < Dy 
m, = ppr pr" > + p,”™ 


. 


Gy = WPi Pr Ps * + + D™, 


Let kı be the smallest exponent that appears on p,; that is, k, is the 
minimum of M41, Ma1, Majs « «5 Mai: Similarly, let k, be the smallest 
exponent that appears on pz, and so on. Use Theorem 10.13 to verify 
that d = pp, ... pisa godof a,...,a, E 


In an arbitrary unique factorization domain, it may not be possible to write the 
gcd of elements a and > as a linear combination of a and b as it was in Z and F[x]. In 
Section 10.5, for example, we shall see that 1 is a gcd of the polynomials x and 2 in 
the UFD Z[x], but 1 is not a linear combination of x and 2 in Z[x] (Exercise 6). In a 
principal ideal domain, however, the ged of a and b can always be written as a linear 
combination of a and b (Exercise 20), 


E Exercises 


A. 1. If a, b are nonzero elements of an integral domain and a is a nonunit, prove 
that (ab) Ç (b). 
2. Suppose p is an irreducible element in an integral domain R such that whenever 
p | bc, then p |b or p|c. If p | ajaz- > - a,, prove that p divides at least one a,. 
3. (a) Prove that the only units in Q,[x] are 1 and —1. [Hint: Theorem 4.2.] 
(b) If f(x) € Q,[x], show that its only associates are f(x) and —f(x). 
4. Isa field a UFD? 


5. Give an example to show that a subdomain of a unique factorization domain 
need not be a UFD. 


6. Prove that 1 is not a linear combination of the polynomials 2 and x in Z[x], that 
is, prove it is impossible to find f(x), g(x) Z[x] such that 2 f(x) + xg(x) = 1. 
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a 


14. 


15. 
16. 


17. 


Arithmetic in Integral Domains 


Let d be a gcd of a}, - . . , a, in an integral domain. Prove that every associate 
of dis also a god of ay, ... , ay. 


. Let p be an irreducible element in an integral domain. Prove that lp is a gcd of 


panda if and only if p ¥ a. 


. Let R be a PID. If (c) is a nonzero ideal in R, then show that there are only 


finitely many ideals in R that contain (c). [Hint: Consider the divisors of e.) 


. Prove that an ideal (p) in a PID is maximal if and only if p is irreducible. 
. Prove that every ideal in a principal ideal domain R (except R itself ) is 


contained in a maximal ideal. [Hint: Exercise 10.] 


. Prove that an ideal in a PID is prime if and only if it is maximal. 


[Hint: Exercise 10.] 


. Let fR — S be a surjective homomorphism of rings with identity. 


(a) If R is a PID, prove that every ideal in S is principal. 
(b) Show by example that S need not be an integral domain. 


Let p be a fixed prime integer and let R be the set of all rational numbers that 
can be written in the form a/b with b not divisible by p. Prove that 


(a) R is an integral domain containing Z. [Note n = 2/1]. 
(b) If a/be Rand p + a, then a/b is a unit in R. 
(c) If Tis a nonzero ideal in R and J + R, then J contains p‘ for some t > 0. 


(d) Risa PID. (If I is an ideal, show that I= (p*), where p* is the smallest 
power of p in Z.) 


Let I be a nonzero ideal in Z[i]. Show that the quotient ring Z[#]// is finite. 


(a) If pis prime in Z, prove that the constant polynomial p is irreducible in 
Q,[x]. [Hint: Theorem 4.2 and Exercise 3.] 


(b) If p and q are positive primes in Z with p # q, prove that p and q are not 
associates in Q,[x]. 


(a) Show that the only divisors of x in Q,[x] are the integers (constant poly- 
nomials) and first-degree polynomials of the form n* with 0 + neZ. 

(b) For each nonzero n EZ, show that the polynomial L is not irreducible 
in Qz[x]. [Hint: Theorem 10.1.] 


(c) Show that x cannot be written as a finite product of irreducible elements 


in Q,[x]. 


. A ring R is said to satisfy the ascending chain condition (ACC) on ideals if 


whenever I; © hS hS: isa chain of ideals in R (not necessarily principal 
ideals), then there is an integer n such that J, = J, for all j = n. Prove that if 
every ideal in a commutative ring R is finitely generated, then R satisfies the 
ACC. [Hint: See Theorem 6.3 and adapt the proof of Lemma 10.10.] 
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19. 


20. 


21. 


22. 
. Give an alternative proof of Lemma 10.11 as follows. If p | b, there is nothing to 


25. 


26. 


27. 


29. 
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A ring R is said to satisfy the descending chain condition (DCC) on ideals if 
whenever J, 2 , 2 J,>°* + is achain of ideals in R, then there is an integer n 
such that J = J, for all j =n. 


(a) Show that Z does not satisfy the DCC. 


(b) Show that an integral domain R is a field if and only if R satisfies the 
DCC. [Hint: If 0 + aE Ris not aunit, what can be said about the chain 
of ideals (a) 2 (a 2(a*)2--+ 9] 

Let R be a PID and a, bE R, not both zero. Prove that a, b have a greatest 

common divisor that can be written as a linear combination of a and b. 

[Hint: Let I be the ideal generated by a and b (see Theorem 6.3); then J = (d) 

for some dE R. Show that dis a gcd of a and b.] 


Let R be a PID and S an integral domain that contains R. Let a, b, dE R. 
If dis a gcd of a and b in R, prove that dis a gcd of a and b in S. 
[Hint: See Exercise 20.] 


Extend Exercise 20 to any finite number of elements. 


prove. If p ¥ b, then lp is a gcd of p and b by Exercise 8. Now show that p | e by 
copying the proof of Theorem 1.4 with p in place of a and Exercise 20 in place 
of Theorem 1.2. 


. Let R be an integral domain. Prove that R is a PID if and only if (i) every 


ideal of R is finitely generated (Theorem 6.3) and (ii) whenever a, bE R, the 
sum ideal (a) + (b) is principal. [Sum is defined in Exercise 20 of Section 6.1.] 


Let R be an integral domain in which any two elements (not both 0g) have 


a gcd. Let (r, s) denote any gcd of r and s. Use ~ to denote associates as in 
Exercise 6 of Section 10.1. Prove that for all r, s, tE R: 


(a) If s~ t, then rs ~ rt. 

(b) If s~ t, then (r, s) ~ (r, t). 

(c) r{s, t) ~ (rs, rt). 

(d) (r, (s, 2)) ~ ((r, £), £). [Hint: Show that both are gcd’s of r, s, t.] 

Let R be an integral domain in which any two elements (not both 0z) have a 
gcd. With the notation of Exercise 25, prove that if (b, c)~ lg and (b, d)~ lg, 
then (b, cd) ~ 1p. [Hint: By Exercise 25(a) and (c), d ~ (bd, cd), so that 

lp ~ (b, d) ~ (b, (bd, cd)). Apply parts (d), (c), and (a) of Exercise 25 to show 
that (b, (bd, cd)) ~ (b, cd). 

Let R be an integral domain in which any two elements (not both zero) have a 


gcd. Let p be an irreducible element of R. Prove that whenever p | cd, then p | c€ 
or p | d. [Hint: Exercises 8 and 26.] 


. If Ris a UFD, if a, b, and c are elements such that a | c and b | c, and if lpisa 


gcd of a and b, prove that ab | c. 
Let R be a UFD. If a | bc and if 1p is a gcd of a and b, prove that a | e. 
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30. A least common multiple (1cm) of the nonzero elements a,,..., a is an 
element b such that (i) each a, divides b and (ii) if each a, divides an element c, 
then b | c. Prove that any finite set of nonzero elements in a UFD has a least 
common multiple. 

31. Prove that nonzero elements a and b in R have a least common multiple if and 
only if the intersection of the principal ideals (a) and (b) is also a principal ideal. 


C. 32. Prove that every ideal Zin VAAL is finitely generated (Theorem 6.3) as 
follows. Let Jy = IN Z and let h = {hEZ|a + bVd EI for some a EZ}. 


(a) Prove that I and J, are ideals in Z. Therefore, I = (ro) and J, = (r;) for 
some r,EZ. 


(b) Prove that ho 1. 


(c) By the definition of A, there exists a, E Z such that a, + nV isin Z. Prove 
that Jis the ideal generated by rọ and a, + rva. [Hint If r + sVa El, 
then s EJ so that s = r,s,. Show that (r + Vd) — s(a + nVa) E l; use 
this to write r + sV'd as a linear combination of ro and a, + nV] 


33. Prove that p(x) is irreducible in Q,[x] if and only if p(x) is either a prime 
integer or an irreducible polynomial in Q[x] with constant term +1. 
Conclude that every irreducible p(x) in Q[x] has the property that 
whenever p(x) | e(x)d(x), then p(x) | e(x) or p(x) | d(x). 


34. Show that every nonzero f(x) in Q,[x] can be written in the form 
cx"p,(x) ++ * px), with cE Q, n = 0, and each p,(x) nonconstant irreducible 
in Q,[x] and that this factorization is unique in the following sense: If f(x) = 
dx"q,(x) «++ g{x) with dE Q, m = 0, and each q,(x) nonconstant irreducible 
in Q,[x], then ¢ = +d, m =n, k = t, and, after relabeling if necessary, each 
Px) = q(x). 

35. Prove that any two nonzero polynomials in Q,[x] have a gcd. 


36. (a) Prove that f(x) is irreducible in Z[x] if and only if f(x) is either a prime 
integer or an irreducible polynomial in Q[x] such that the gcd in Z of the 
coefficients of f(x) is 1. 


(b) Prove that Z[x] is a UFD. [Hint: See Theorems 4.14 and 4.23.] 


103 Factorization of Quadratic Integers* 


In this section we take a closer look at the domains Z[ Vd]. Because unique factoriza- 
tion frequently fails in these domains, they provide a simplified model of the kinds of 
difficulties that played a crucial role in the historical origin of the concept of an ideal. 
These domains also illustrate how ideals can be used to “restore” unique factorization 
in some domains that lack it. We begin with a brief sketch of the relevant history. 


*The prerequisites for this section are pages 322-324 of Section 10.1 and the definition of unique 
factorization domain (page 337). 
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Early in the last century, Gauss proved the “Law of Biquadratic Reciprocity,” 
which provides a fast way of determining whether or not a congruence of the form 
x* = c (mod n) has a solution. Although the statement of this theorem involves only 
integers, Gauss’s proof was set in the larger domain Z[i]. He proved and used the fact 
that Z[/] is a unique factorization domain. 

Since Gauss’s proof involved £[i] and 7 is a complex fourth root of 1, the German 
mathematician E. Kummer thought that analogous theorems for congruences of 
degree p might involve unique factorization in the domain. 


Ziw] = {æ + aw + aw? + -++ + a, yw? "| a EZ}, 


where w = cos(27/p) + i sin (277/p) is a complex pth root of 1. He was unable to 
develop higher-order reciprocity theorems because he discovered that Z[w] may not 
bea UFD.* 

Later in the century questions about unique factorization arose in connection 
with the following problem. It is easy to find many nonzero integer solutions of the 
equation x” + y? = 2°, such as 3, 4, 5, or 5, 12, 13. But no one has ever found nonzero 
integer solutions for X + y? = z or x4 + yf = 2+, which suggests that 


x" + y" = 2" has no nonzero integer solutions when n > 2. 


This statement is known as Fermat's Last Theorem because in the late 1630s Fermat 
wrote it in the margin of his copy of Diophantus’ Arithmetica and added “I have 
discovered a truly remarkable proof, but the margin is too small to contain it.” Fermat’s 
“proof” has never been found. Most mathematicians today doubt that he actually had 
a valid one. 

In 1847 the French mathematician G. Lame thought he had found a proof of 
Fermat’s Last Theorem in the case when z is prime.‘ His proof used the fact that for 
any odd positive prime p, x? + y? can be factored in the domain Z[w] described above: 


xP + y? = (x + pix + wyl(x + wy) +++ (x + wP y). 


Lame’s purported proof depended on the assumption that Z[w] is a unique factoriza- 
tion domain. When he became aware of Kummer’s work, he realized that his proof 
could not be carried through. 

Kummer had already found a way to avoid the difficulty. He invented what he 
called “ideal numbers” and proved that unique factorization does hold for these ideal 
numbers. This work eventually led to a proof that Fermat’s Theorem is true for a large 
class of primes, including almost all the primes less than 100. This was a remark- 
able breakthrough and deeply influenced later work on the problem.‘ But it had even 
greater significance in the development of modern algebra. For Kummer’s “ideal num- 
bers” were what we now call ideals. 

We shall return to ideals at the end of the section. Now we consider factorization 
in the domains Z[Vd]. These domains are similar to the ones that Kummer used and 


*The domain Z[w] is a UFD for every prime p less than 23 and fails to be a UFD for every larger prime. 
tif the theorem is true for prime exponents, then it is true for all exponents; see Exercise 1. 


§Fermat's Last Theorem was finally proved in 1994 by Andrew Wiles. His proof uses results and 
techniques not available until relatively recently. 
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illustrate in simplified form the problems he faced and his method of solution. We 
shall assume that the integer dis square-free, meaning that d # 1 and d has no integer 
factors of the form c? except (+1)*. The following function is the key to factorization 
in Z[V 4d]. 


Definition The function N: Z[ Vd] — Z given by 
Ns + td) = (s + tVd\(s—tvd) = 2-a? 


is called the norm. 


For example, in Z[V3], 
N5 + 2V3) = 59 -3:2 =13 and M2 —4V3) = 2 — 3(-4)? = —44. 
Note that 


when d < 0, the norm of every element is nonnegative. 
For instance, in Z[V—S], 
Ms + tV—-5) = $- (5P = + 5P=0. 


In Example 7 of Section 10.1, we saw that the norm makes Z[i] = Z[V—1] into a 
Euclidean domain. This is not true in general, but we do have 


Theorem 10,19 


If d is a square-free integer, then for all a, b € Z[Vd] 


(1) N(a) = 0 if and only if a = 0. 
(2} N(ab) = N(a)N(6). 


Proof» (1) If a = s + Vā, then N(a) = 2 — de’ so that N(a) = 0 if and only if 
£ = dt. If d= —1, then ê = —¢ can occur in Z if and only if s = 0 = £, 
that is, if and only if a = 0. So suppose d —1. Every prime in the 
factorization of s* and # must occur an even number of times. But the 
prime factors of d do not repeat because d is square-free. So if pis a prime 
factor of d, it must occur an odd number of times in the factorization 
of df’. By unique factorization in Z, the equation 3 = df is impossible 
unless s = 0 = t, that is, unless a = 0. 

(2) Leta =r + sVd and b = m + nV. The proof is astraightfor- 
ward computation (Exercise 3). @ 
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Theorem 10,20 


Let d be a square-free integer. Then u €Z[‘Vd] is a unit if and only if Mu) = + 


Proof 1f wis a unit, then wv = 1 for some v€ Z[Vd ]. By Theorem 10.19, 
N(u)N(v) = N(uv) = M(1) = 1? — d+ 0? = 1. Since M(x) and N(v) 
are integers, the only possibilities are N(u) = +1 and Mv) = +1. 
Conversely, if u = s + tVd and N(u) = +1, let u = s — tVdeZ[Va]. 
Then by the definition of the norm, wu = N(u) = +1. Hence, 
u(tu) = landwisaunit. E 


EXAMPLE 1 


In Z[V2] the element 3 + 2V2 is a unit because N(3 + 2V2) = 

3? — 2+ 2?= 1. Verify that the inverse of 3 + 2V2is 3 — 2V2. Every 
power of a unitis also a unit, so Z[V2] has infinitely many units, including 
(3 + 2V2), (3 + 2V2)?,(3 + 2V/72)3,. 


According to Theorem 10.20 we can determine every unit s + tVd in Z[V/d] by 
finding all the integer solutions (for $ and #) of the equations s* — d? = +1. When 
d > 1, these equations have infinitely many solutions (see the preceding example and 
Burton [12]. When d = —1, the equations reduce to 3 + # = 1.* The only integer 
solutions are s = +1, t = 0, and s = 0, t = +1, So the only units in Z[i] = 7[V—1] are 
+1 and +i. If d< —1, say d = —k with k > 1, then the equations reduce to $ê + k? = 1.* 
Since k > 1, the only integer solutions are s = +1, £ = 0. Thus we have 


Corollary 10.21 


Let d be a square-free integer. If d > 1, then Z[ Vd] has infinitely many units. 
The units in Z[V— 1] are +1 and +i. Ifd < —1, then the units in Z[ Vd] are +1. 


Corollary 10.22 


Let d be a square-free integer. If p € Z[Vd] and Nip) is a prime integer in Z, 
then p is irreducible in Z[ Vd]. 


Proof» since N(p) is prime, N(p) + +1, so pis not a unit in Z[Vd] by 
Theorem 10.20. If p = ab in Z[Vd], then by Theorem 10.19, N(p) = 
N(a@)N(b) in Z. Since N(a), N(b), N(p) are integers and N(p) is prime, 
we must have M(a) = +1 or Mb) = +1. So aor bis a unit by Theorem 
10.20. Therefore, p is irreducible by Theorem 10.1. & 


*Since the left side of the equation is always nonnegative, —1 cannot be on the right side. 
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EXAMPLE 2 


Theelement 1 — /is irreducible in Z[i] because N — V —1) = 2. Similarly, 1 + i 
is also irreducible. Therefore, a factorization of 2 as a product of irreducibles in 
Z[i] is given by 2 = (1 + 0 — Ò. 


The converse of Corollary 10.22 is false. For instance, in Z[VV—5] the norm of 
1 + V—5 is 6, which is not prime in Z. But the next example shows that 1 + V—5 is 
irreducible in Z[V—5]. 


EXAMPLE 3 


To show that 1 + \—5 is irreducible in Z[V—5], suppose 1 + /—5 = ab. By 
Theorem 10.1 we need only show that a or b is a unit. By Theorem 10.19, 
N(a)N(b) = Mab) = N(1 + V—5) = 6. Since N(a) and N(b) are nonnegative 
integers, the only possibilities are N(a) = 1, 2, 3, or 6. If a= s + tV—5 and 
N(a) = 2, then s* + 57? = 2. It is easy to see that this equation has no integer 
solutions for s and #; so N(a) = 2 is impossible. A similar argument shows 
that M(a) = 3 is impossible. If N(a) = 1, then ais a unit by Theorem 10.20. If 
N(a) = 6, then N(b) = 1 and b is a unit. Therefore, 1 + V—S is irreducible. 


We have seen an example of an integral domain in which a nonzero, nonunit element 
could not be factored as a product of irreducibles (Exercise 17 in Section 10.2). We shall 
now see that Z[‘Vd] may fail to be a UFD for a different reason: Although factorization 
as a product of irreducibles is always possible in #[ Vd], it may not be unique. 


Theorem 10,23 


Let d be a square-free integer. Then every nonzero, nonunit element in Z[Vd] 
is a product of irreducible elements.* 


Proof» Let S be the set of all nonzero, nonunits in Z[Vd] that are not the product 
of irreducibles. We must show that S is empty. So suppose, on the con- 
trary, that S is nonempty. Then the set W = {| N(t)||tES} isa nonempty 
set of positive integers. By the Well-Ordering Axiom, W contains a small- 
est integer. Thus there is an element a E S such that | N(a) |= | M(d | for 
every t E S. Since a E S we know that a is not itself irreducible. So there 
exist nonunits b, c € Z[V/d] such that a = be. At least one of b, c must 
bein S (otherwise a would be a product of irreducibles and, hence, not 
in 5), say b E S. Since b and c are nonunits, |N(b) | > 1 and| M(c)|> 1 by 
Theorem 10.20. But| N(a) | = | M(B) || N(c)| by Theorem 10.19, so we must 
have 1 <|N(b)|<|N(@)| But b E S, so | Ma) |= | Nb) | by the choice of a. 
This is a contraction. Therefore, S is empty, and the theorem is proved. E 


* As usual, we allow a “product” with just one factor. 
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EXAMPLE 4 


The domain Z[V— 5] is not a unique factorization domain. The element 6 in 
Z[V—5] has two factorizations: 

6=2:3 and 6=(1+ V—5\(1 — V-75). 
The proof that 1 + \/—5 is irreducible was given in Example 3 The proofs that 2, 3, 
and 1 — V—S areirreducible are similar. Forinstance, if 2 = ab, then Ma)N(b) = 
N(ab) = N2) = 4 so that Ma) = 1, 2, or 4. But Ma) = 2 is impossible because the 
equation s* + 5/2 = 2 has no integer solutions, So either Ma) = 1 and a is a unit, 
or Na) = 4. In the latter case MD) = 1 and b is a unit. Therefore, 2 is irreducible by 
Theorem 10.1. Since the only units in Z[/— 5] are +1, it is clear that neither 2 nor 3 
is an associate of 1 + V—S or 1 — V—5. Thus the factorization of 6 as a product of 
irreducibles is not unique up to associates and 2[-V—5] is nota UFD. 


The preceding example demonstrates that the irreducible 2 divides the product 
(1 + V-54 — V-5 in Z[V—5] but does not divide either 1 + V—5 or 1-— V—5. 
So when unique factorization fails, an irreducible element p may not have the property 
that when p |ed, then p | c or p | d.* Another consequence of the failure of unique fac- 
torization is the possible absence of greatest common divisors (Exercise 13). 


Unique Factorization of Ideals 


We are now in the position that Kummer was in a century and a half ago and the 
question is: How can some kind of unique factorization be restored in domains such 
as Z[V—5]? Kummer’s answer was to change the focus from elements to ideals.t The 
product 7J of ideals J and J is defined to be the set of all sums of elements of the form 
ab, with a e T and bc J; that is, 


IJ = {a,b, + ab, +e- anaba \ n 21, a, EL, bed}. 
Exercise 36 in Section 6.1 shows that JJ is an ideal. Instead of factoring an element a 


as a product of irreducibles, Kummer factored the principal ideal (a) as a product of 
prime ideals. 


EXAMPLE 5 


We shall express the principal ideal (6) in Z[V—5] as a product of prime ideals. 
The irreducible factorization of elements 6 = 2 + 3 seems a natural place to start, 
and it is easy to prove that the ideal (6) is the product ideal (2)(3) (Exercise 16). 
But (2) is not a prime ideal (for instance, the product (1 + V—5) (1 — V—5) = 6 
is in (2) but neither of the factors is in (2)). So we must look elsewhere. Let P be 
the ideal in Z[V—5] generated by 2 and 1 + V—5, that is, 


P = {2a + (1+ V—5)b|a, be ZV- SI}. 


*This is not particularly surprising in view of Theorem 10.16. 
tKummer used different terminology, but the ideas here are essentially his. We use the modern 
terminology of ideals that was introduced by R. Dedekind, who generalized Kummer's theory. 
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Then P is an ideal by Theorem 6.3. Exercise 17 shows that r + sV—5e Pif and 
only if r and s are both even or both odd. This implies that the only distinct 
cosets in Z[/—5]/P are 0 + P and 1 + P, as we now see: If m + uV—5 

has m odd and n even, then (m + nV—5) — 1 = (m — 1) + nV—SEP because 
m — 1 and n are even. Hence, (m + n/—5) + P = 1 + P. Similarly, if m is 
even and n is odd, then (m — 1) + #V/—Se P because m — 1 and n are odd. It 
follows that the quotient ring Z[V—5]/P is isomorphic to Z,. Therefore, P is 

a prime ideal in Z[/—5] by Theorem 6.14. A similar argument (Exercise 19) 
shows that Q, and Q, are prime ideals, where 


Q, = {3a + (1 + V—5)b| a, b E Z[V—5}]}, 
Q = {3a + (1 — V—5)b| a, bE Z[V—5}}. 
Exercises 18 and 19 show that the product ideal P? = PP is precisely the 


ideal (2) and that Q,Q, = (3). Therefore, the ideal (6) is a product of four 
prime ideals: (6) = (2)(3) = P?Q,0). 


Kummer went on to show that in the domains he was considering, the factorization 
of an ideal as a product of prime ideals is unique except for the order of the factors. 
This result was later generalized by R. Dedekind. In order to state this generalization 
precisely, we need to fill in some background. 

An algebraic number is a complex number that is the root of some monic polyno- 
mial with rational coefficients. If ¢ is an algebraic number and fis the root of a poly- 
nomial degree n in Q[x], then 


Q(t) = {ay + at + agi? +--+ +a, 42""|a,EQ} 


is a subfield of C and every element in Q(#) is an algebraic number.* An algebraic 
integer is a complex number that is the root of some monic polynomial with integer 
coefficients. It can be shown that the set of all algebraic integers in Q(t) is an integral 
domain. If w is a complex root of x? — 1, then the domain Z[w] that Kummer used 
is in fact the domain of all algebraic integers in Q(w) (see Ireland and Rosen [13; 
page 199]). So Kummer’s results are a special case of 


Theorem 10.24 


Let £ be an algebraic number and R the domain of all algebraic integers in 
Q(t). Then every ideal in R (except 0 and A) is the product of prime ideals 
and this factorization is unique up to the order of the factors. 


For a proof see Ireland and Rosen [13; page 174]. 

Most of the rings Z[Vd] are also special cases of Theorem 10.24. Forif dis a square- 
free integer, then tf = Vd is an algebraic number (because it is a root of X? — d) and 
Q(Vd) = {a + a; Vd | a,€ Q}. The algebraic integers in the field Q(V/d) are called 


*For a proof see Theorems 11.7 and 11.9. 
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quadratic integers. Every element r + sVd of Z[Vd] is a quadratic integer in Q( Vd) 
because it is a root of this monic polynomial in Z[x]: 


x? — 2rx + (P — ds’) = (x — (r + svd- (t - sd). 


When d = 2 or 3 (mod 4), then Z[V/d] is the domain R of all quadratic integers in 
Q(Vd), but when d = 1 (mod 4), there are quadratic integers in R that are not in 
Z[V4d] (see Exercise 22).* 

Theorem 10.24 has proved very useful in algebraic number theory. But it does not 
answer many questions about unique factorization of elements, such as: If R is the 
domain of all quadratic integers in Q(Vå), for what values of dis Ra UFD? When 
d<0, Risa UFD if and only if d= —1, —2, —3, —7, —11, —19, —43, —67, or —163 
(see Stark [19]). When d > 0, Ris known to be a UFD for d = 2, 3, 5, 6, 7, 11, 13, 17, 
19, 21, 22, 23, 29, and many other values. But there is no complete list as there is when 
dis negative. It is conjectured that R is a UFD for infinitely many values of d. 


E Exercises 


A. 1. If x + y* = # has no nonzero integer solutions and k |n, then show that 
x + y® = 2" has no nonzero integer solutions. 


2. Let w be a complex number such that œ = 1. Show that 
Zo] = {a9 + aw + ago? +--+ +a, oF? '|a,EZ} 
is an integral domain. [Hint: oP = 1 implies w*! = w, wt? = w’, etc.] 
. If a = r + sVd and b = m + nVdin Z[Vd], show that N(ab) = N(a)N(b). 
. Explain why Z[V-— 5] is not a Euclidean domain for any function 6. 


NS WwW 


. If aE Q is an algebraic integer, as defined on page 350, show that ae Z. 
[Hint: Theorem 4.21.] 


B. 6. In which of these domains is 5 an irreducible element? 
(a) Z (b) Zi] (e) AV —2] 


7. In Z[V—7], factor 8 as a product of two irreducible elements and as a product 
of three irreducible elements. [Hint: Consider (1 + V—7)( — V—7).] 


8. Factor each of the elements below as a product of irreducibles in Z[i], [Hint: 
Any factor of a must have norm dividing N(a).] 


()3 7 (@4+3 @ 47 


9. (a) Verify that each of 5 + V2, 2 — V32, 11 — 7V2, and 2 + V2 is irreducible 
in Z[V2]. 


*Since dis square-free, d # 0 (mod 4). 
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(b) Explain why the fact that 
(5 + V2X2 — V2) = (11 — 7V2)(2 + V2) 


does not contradict unique factorization in Z[V2]. 
10. Find two different factorizations of 9 as a product of irreducibles in Z[V—5]. 
11. Show that Z[V—6] is not a UFD. [Hint: Factor 10 in two ways.] 
12. Show that Z[V10] is not a UFD. [Hint: Factor 6 in two ways.] 


13. Show that 6 and 2 + 2/—5 have no greatest common divisor in Z[W— 5]. 
[Hint: A common divisor a of 6 and 2 + 2\/—5 must have norm dividing 
both M(6) = 36 and N(2 + 2\/—5) = 24; hence, a = r + sW—5 with 7 + 
5s* = N(a) = 1, 2, 3, 4, 6, or 12. Use this to find the common divisors. Verify 
that none of them is divisible by all the others, as required of a gcd. Also see 
Example 4.] 


14. Show that | isa gcd of 2and 1 + V—5 in Z[V—5), but 1 cannot be written in 
the form 2a + (1 + V—5)é witha, bE Z[V—5]. 


15. Prove that every principal ideal in a UFD is a product of prime ideals 
uniquely except for the order of the factors. 


16. Show that (6) = (2)(3) in Z[V —5]. (The product of ideals is defined on page 349.) 
17. Let P be the ideal {2a + (1 + V—5)b|a, bE Z[V—5)} in Z[V—5]. Prove that 


r+sV—5SeP if and only if r = s (mod 2) (that is, r and s are both even or 
both odd). 


18. Let P be as in Exercise 17. Prove that P’ is the principal ideal (2). 
19. Let Q, be the ideal {3a + (1 + V—5)b|a, be Z[V-5]} and Q; the ideal 
{3a + (1 — V—5)b|a, bE Z[V—5}} in Z[V—5}. 
(a) Prove that r + sW—Se Q, if and only if r = s (mod 3). 
(b) Show that Z[/—5]/Q; has exactly three distinct cosets. 
(c) Prove that Z[V—5]/Q, is isomorphic to Z3; conclude that Q, is a prime ideal. 
(d) Prove that Q; is a prime ideal. [Hint: Adapt (a)—(c).] 
(e) Prove that Q,Q, = (3). 


20. If r+ sV -—SEZ[V—S] with s + 0, then prove that 2 is not in the principal 
ideal (r + sV/—5). 


21. If dis a square-free integer, prove that Z[ Vd] satisfies the ascending chain 
condition on principal ideals. 


C.22. Let d be a square-free integer and let Q(Vd) be as defined on page 350. We 
know that Z[Vd] € Q(Vd) and every element of Z[Vd] is a quadratic integer. 
Determine all the quadratic integers in Q(Vđ) as follows. 


(a) Show that every element of Q(Vd) is of the form (r + s Vd)/t, where 
r,s, t EZ and the gcd (r, s, £) of r, s, t is 1. Hereafter, let a = (r + sVd)jt 
denote such an arbitrary element of ava). 
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(b) Show that a is a root of 
p(x) = x2 - (=). + (= ze eo 


[Hint: Show that p(x) = (x — a)(x — a), where a = (r — sVd)/t.] 
(c) Ifs # 0, show that p(x) is irreducible in Q[x]. 


(d) Prove that a is a quadratic integer if and only if p(x) has integer 
coefficients. [Hint: If s # 0, use Exercise 5; if s + 0 and a is a root of a 
monic polynomial f(x) € Z[x], use Theorem 4.23 to show that æ is a root 
of some monic g(x) € Z[x], with g(x) irreducible in Q[x]. Apply (c) and 
Theorem 4.14 to show g(x) = p(x).] 


(e) If ais a quadratic integer, show that ¢|2r and ¢*|4ds?. Use this fact to prove 
that ¢ must be 1 or 2. [Hint: d is square-free, (r, s t) = 1; use (b) and (d).] 


(f) If d=2 or 3 (mod 4), show that ais a quadratic integer if and only if 
t = 1. [Hint: If t = 2, then r? = d? (mod 4) by (b) and (d). If s is even, 
reach a contradiction to the fact that (r, s, t) = 1; if s is odd, use Exercise 7 
of Section 2.1 to get a contradiction.] 


(g) If d = 1 (mod 4) and æ € Q( Vd), show that a is a quadratic integer if and 
only if ¢ = 1, or t = 2 and bothr and s are odd. [Hint: Use (d).] 
(h) Use (f) and (g) to showthat the set of all quadratic integers in Q( Vd) is Z[Vd] 
fmt 
if d= 2 or 3 (mod 4) and here 


if d= 1 (mod 4). 


|m, n, © Zand m = n(mod 2) 


104 The Field of Quotients of an Integral Domain* 


For any integral domain R we shall construct a field F that contains R and consists of 
“quotients” of elements of R. When the domain R is Z, then F will be the field Q of 
rational numbers. So you may view these proceedings either as a rigorous formaliza- 
tion of the construction of Q from Z or as a generalization of this construction to 
arbitrary integral domains. The field F will be the essential tool for studying factoriza- 
tion in R[x] in Section 10.5. 

Our past experience with rational numbers will serve as a guide for the formal 
development. But all the proofs will be independent of any prior knowledge of the 


rationals. 
A rational number a/b is determined by the pair of integers a, b (with b # 0). But 
3 ; ; : $ 1 4 
different pairs may determine the same rational number; for instance, > = A = x and 


in general 


ga č ; ‘ 
ame if and only if ad = be. 


*This section is independent of the rest of Chapter 10. Its prerequisites are Chapter 3 and Appendix D. 
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This suggests that the rationals come from some kind of equivalence relation on pairs 
of integers (equivalent pairs determine the same rational number). We now formalize 
this idea. 

Let R be an integral domain and let S be this set of pairs: 


S = {(a, b) |a, bE Rand b # Og}. 
Define a relation ~ on the set S by 


(a, b) ~ (e, d) means ad= bcin R. 


Theorem 10.25 


The relation ~ is an equivalence relation on S. 


Proof » Reflexive: Since r is commutative ab = ba, so that (a, b) ~ (a, b) for every 
pair (a, b) in S. Symmetric: If (a, b) ~ (e, d), then ad = bc. By commutativ- 
ity cb = da, so that (c, d) ~ (a, b). Transitive: Suppose that (a, b) ~ (c, d) 
and (c, d) ~ (r, s). Then ad = be and cs = dr. Multiplying ad = be by s and 
using cs = dr we have ads = (bc)s = b(cs) = bdr. Since d Op by the defini- 
tion of S and R is an integral domain we can cancel d from ads = bdr and 
conclude that as = br. Therefore, (a, b) ~ (r,s). m 


The equivalence relation ~ partitions S into disjoint equivalence classes by Corollary D.2 
in Appendix D. For convenience we shall denote the equivalence class of (a, b) by [a, b] rather 
than the more cumbersome [(a, )]. Let F denote the set of all equivalence classes under ~. 
Note that by Theorem D1, 


[a, b] = [c, d] in F if and only if (a, b) ~ (c, d) in S. 
Therefore, by the definition of ~, 
[a, b] = [c, d] in F if and only if ad = bc in R. 


We want to make the set F into a field. Addition and multiplication of equivalence 
classes are defined by 
[a, b] + [c, d] = [ad + be, bd] 
[a, ble, d] = fac, bd).* 


In order for this definition to make sense, we must first show that the quantities on 
the right side of the equal sign are actually elements of the set F. Now [a, b] is the 


*These definitions are motivated by the arithmetical rules for rational numbers (just replace the 
fraction r{s by the equivalence class [f, s]): 
a c ad+bc a c æ 
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equivalence class of the pair (a, b) in S. By the definition of S we have b # Og; simi- 
larly, d # Og. Since R is an integral domain, bd + Og. Thus (ad + be, bd) and (ac, bd) 
are in the set S, so that the equivalence classes [ad + be, bd] and [ac, bd] are elements 
of F. But more is required in order to guarantee that addition and multiplication in 


F are well defined. 
la og! 3 
In ordinary arithmetic, oe o —and replacing 5 A by — $ produces the same answer 
43 12 3 
because — 3 a = AD = io The answer doesn’t depend on how the fractions are repre- 


sented. Similarly, in F we must show that arithmetic does not depend on the way the 
equivalence classes are written: 


Lemma 10.26 


Addition and multiplication in F are independent of the choice of equivalence 
class representatives. In other words, if [a, b] = [a’, 6’] and [c, d) = [c’, d'], then 
[ad + be, bd] = [a’d’ + b'c’, b’a"] 


and 
[ac, bd] = [a’c’, b’d"]. 
Proof» As noted above [ad + be, bd] = [a'd’ + b'c', b'd']in Fif and only if 


(ad + bc)b'd' = bda'd' + b'c’) in R. So we shall prove this last state- 
ment. Since [a, b] = [a’, b’] and [c, d] = [c’, d'] we know that 


(+) ab! = ba and cd’ =de. 
Multiplying the first equation by dd’ and the second by bb’ and adding 
the results show that 

ab'dd' = ba'dd' 

cd'bb’ = de'bb' 


ab'dd' + cd'bb’ = ba'dd' + dc'bb' 
(ad + be)b'd' = bd(a'd' + b'c’). 
Therefore, [ad + be, bd] = [a’d’ + b'c’, b'd']. 


For the second part of the proof multiply the first equation in (+) by 
cd’ and the second by 4a’ so that 


ab' cd’ = ba'cd’ and cedbæ = deba. 


By commutativity the right side of the first equation is the same as the 
left side of the second equation so that the other sides of the two equa- 
tions are equal: ab’cd’ = dc’ ba’. Consequently, 


(ac)(b'd’) = ab'ed’ = de'ba’ = (bd)\(a'c'). 


The two ends of this equation show that [ac, bd] = [a'c', b’d’]. m 
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Lemma 10.27 


If Ris an integral domain and F is as above, then for all nonzero a, b, c, d, k ER: 
(1) [0g, 6] = [0p, d]; 
(2) [a, 6] = [ak, bk]; 
(8) [a, a] = [c, c]. 


Proof» Exercise 1. w 


Lemma 10.28 


With the addition and multiplication defined above, F is a field. 


Proof» Closure of addition and multiplication follows from Lemma 10.26 and 
the remarks preceding it. Addition is commutative in F because addition 
and multiplication in R are commutative: 


[a, b] + [c, d] = [ad + be, bd] = [cb + da, db] = [c, d] + [a, b]. 


Let 0p be the equivalence class [Og, b] for any nonzero bE R (by (1) in 
Lemma 10.27 ail pairs of the form (Og, b) with b # Op are in the same 
equivalence class). If [a, b] E F, then by (2) in Lemma 10.27 (with k = b): 


[a, b] + Op = [a, b] + [0g, b] = [ab + bOr, bb] = [ab, bb] = [a, b). 


Therefore, Oris the zero element of F. The negative of [a, b] in Fis [—a, b] 
because 


[a, b] + [—a, b] = (ab — ba, b°] = [0p, b? = Or- 


The proofs that addition is associative and that multiplication is associa- 
tive and commutative are left to the reader (Exercise 2), as is the verifica- 
tion that [1p, 1g] is the multiplicative identity element in F. If [a, b] isa 
nonzero element of F, then a # Op. Hence, [b, a] is a well-defined element 
of F and by (3) in Lemma 10.27 


[a, B][b, a] = [ab, ba] = [1zab, lpab] = [1p, lz]. 


Therefore, [b, a] is the multiplicative inverse of [a, b]. To see that the dis- 
tributive law holds in F, note that 


[a, bI, d] + [r D= [a, blies + dr, ds] 


= [a(cs + dr), b(ds)] 
= [aes + adr, bds]. 
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On the other hand, by (2) in Lemma 10.27 (with k = b) 
[a, blic, d] + [a, blir, s] = [ac, bd] + [ar, bs] 
= [(ac)(bs) + (bd)ar), (bd)(bs)] 
= [(acs + adr)b, (bds)b] 
= [acs + adr, bds]. 


Therefore, [a, byle, d] + [r, s]) = [a, blic, d) + [a, blir, J]. m 


We usually identify the integers with rational numbers of the form a/1. The same 
idea works in the general case: 


Lemma 10.29 


Let R be an integral domain and F the field of Lemma 10.28. Then the subset 
R* = {[a, 12] |a ER} of F is an integral domain that is isomorphic to R. 


Proof» Verify that R* is a subring of F (Exercise 3). Clearly [1p, 1g], the identity 
element of F, is in R*, so R* is an integral domain. Define a map 
FR — R* by f(a) = [a, 1g}. Then f is a homomorphism: 
fla) + Ke) = [a, 1r] + [c, ln] = [alg + ize, lglg] 
= [a + c, le] = fla + c) 
Fo) = [a, llle, 1a] = fac, 1p] = f(ac). 

If f(a) = f(c), then [a, 1p] = [c, lg], which implies that alg = lge by the 
boldface statement following Theorem 10.25. Thus a = c and fis injec- 
tive. Since f is obviously surjective, f is an isomorphism. W 


The equivalence class notation for elements of Fis awkward and doesn’t convey the 
promised idea of “quotients”. This is easily remedied by a change of notation, Instead 
of denoting the equivalence class of (a, b) by [a, 5], 

denote the equivalence class of (a, 5) by a/b. 
If we translate various statements above from the brackets notation to the new quotient 
notation, things begin to look quite familiar: 


Theorem 10,30 


Let R be an integral domain. Then there exists a field F whose elements are of 
the form afb with a, b eR and b # Qp, subject to the equality condition 


= = F inf ifandonlyif ad= bcinR. 
Addition and multiplication in F are given by 
E MAC i RL E EE, 
b d bd `b d bd 
The set of elements in F of the form afi (a ER) is an integral domain isomor- 


phic to R. 
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Proof» Lemmas 10.28 and 10.29 and the notation change preceding the 
theorem.* W 


It is now clear that if R = Z, then the field F is precisely Q. So Theorem 10.30 may 
be taken as a formal construction of Q from Z. In the general case, we shall follow the 
same custom we use with Q: The ring R will be identified with its isomorphic copy in 
F. Then we can say that R is the subset of F consisting of elements of the form a/l,. 
The field F is called the field of quotients of R. 


EXAMPLE 1 


Let F be a field. The field of quotients of the polynomial domain F[x] is 
denoted by F(x) and consists of all f(x)/g(x), where f(x), g(x) € F[x] and g(x) + 
0x. The field F(x) is called the field of rational functions over F. 


The field of quotients of an integral domain R is the smallest field that contains R 
in the following sense.* 


Theorem 10.31 


Let R be an integral domain and F its field of quotients. If K is a field containing 
R, then K contains a subfield E such that R e E & K and E is isomorphic to F. 


Proof» 1f ajb EF, then a, b E Rand bis nonzero. Since RE K, b~! exists. Define a 
map f:F — K by f (a/b) = ab`". Exercise 9 shows that fis well defined, that 
is, a/b = c/d in F implies f(a/b) = f(c/d) in K. Exercise 10 shows that f is 
an injective homomorphism. If E is the image of F under f, then F = E. 
For eachaE R, a = aly") = f(aj/lh)E€ E, so REGES K. m 


E Exercises 


NOTE: Unless noted otherwise, R is an integral domain ard F its field of quotients. 
A. 1. Prove Lemma 10.27. 
2. Complete the proof of Lemma 10.28 by showing that 
(a) Addition of equivalence classes is associative. 
(b) Multiplication of equivalence classes is associative. 
(c) Multiplication of equivalence classes is commutative. 
3. Show that R* = {[a, lp] | æ € R} is a subring of F. 
*At this point you may well ask, “Why didn't we adopt the quotient notation sooner?” The reason is 
psychological rather than mathematical. The quotient notation makes things look so much like the 
familiar rationals that there is a tendency to assume everything works like it always did, instead of 


actually carrying out the formal (and tiresome) details of the rigorous development. 
1Theorem 10.31 is not used in the sequel. 
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. If Ris itself a field, show that R = F. 
. If R = Zi], then show that F = {r + sil r, seQ}. 
. If R = Z[ Vd], then show that F = {r + sVd |r, s €Q). 


. Show that there are infinitely many integral domains R such that Z & REQ, 
each of which has Q as its field of quotients. [Hinz: Exercise 28 in 
Section 3.1.] 


8. Let fR — R, be an isomorphism of integral domains. Let F be the field of 
quotients of Rand F; the field of quotients of R,. Prove that the map 
S*:F > F, given by f*(a/b) = f(a)/f(6) is an isomorphism. 


9. If Ris contained ina field Kand a/b = c/d in F, show that ab"! = cd in K. 
[Hint: a/b = c/d implies ad = bc in K] 


10. (a) Prove that the map fin the proof of Theorem 10.31 is injective. 
[Hint: f(a/b) = f(c/d) implies ab-! = cd-'; show that ad = be.] 


(b) Use a straightforward calculation to show that fis a homomorphism. 


> oe. eS 


11. Leta, be R. Assume thereare positive integers m, n such that a” = b", a" = 
b", and (m, n) = 1. Prove that a = b. [Remember that negative powers of a and 
b are not necessarily defined in R, but they do make sense in the field F for 
instance, a? = Ip/a*] 


12. Let R be an integral domain of characteristic 0 (see Exercises 41-43 in 
Section 3.2). 


(a) Prove that R has a subring isomorphic to Z [Hinz: Consider {nlp |nEZ}.] 


(b) Prove that a field of characteristic 0 contains a subfield isomorphic to Q. 
[Hint: Theorem 10.31] 


13. Prove that Theorem 10.30 is valid when R is a commutative ring with no 
zero divisors (not necessarily an integral domain). [Hint: Show that for any 
nonzero a€ R, the class [a, a] acts as a multiplicative identity for F and the set 
{[ra, a] | re R} isasubring of F that is isomorphic to R. The even integers are 
a good model of this situation] 


105 Unique Factorization in Polynomial Domains* 


Throughout this section R is a unique factorization domain. We shall prove that the 
polynomial ring R[x] is also a UFD. The basic idea of the proof is quite simple: Given 
a polynomial f(x), factor it repeatedly as a product of polynomials of lower degree 
until f(x) is written as a product of irreducibles. To prove uniqueness, consider f(x) as 


*The prerequisites for this section are pages 322-324 of Section 10.1, the definition of unique 
factorization domain (together with Theorems 10.13, 10.15, and 10.18), and Section 10.4. Theorems 10.13, 
10.15, and 10.18 depend only on the definition of UFD and may be read independently of the rest of 
Section 10.2. 
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a polynomial in F [x], where F is the field of quotients of R. Use the fact that F[x] is 
a UFD (Theorem 4.14) to show that factorization in R[x] is unique. There are some 
difficulties, however, in carrying out this program. 


EXAMPLE 1 


The polynomial 3x? + 6 cannot be factored as a product of two polynomials of 
lower degree in Z[x] and is irreducible in Q[x]. But 3x? + 6 is reducible in Z[x] 
because 3x? + 6 = 3(x? + 2) and neither 3 nor x? + 2 is a unit in Z[x]. 


So the first step is to examine the role of constant polynomials in R[x]. By 
Corollary 4.5 and Exercise 1 


the units in |x] are the units in R 
and 


the irreducible constant polynomials in R[x| are 
the irreducible elements of R. 


For example, the units of Z[x] are +1. The constant polynomial 3 is irreducible in Z[x] 
even though it is a unit in Q[x]. 

The constant irreducible factors of a polynomial in R[x] may be found by factoring 
out any constants and expressing them as products of irreducible elements in R. 


EXAMPLE 2 


In Z[x], 
6x? + 18x + 12 = 6(x? + 3x + 2) = 2+ 3(x? + 3x + 2). 


Note that x? + 3x + 2 is a polynomial whose only constant divisors in Z[x] are 
the units +1. This example suggests a strategy for the general case. 


Let R be a unique factorization domain. A nonzero polynomial in R[x] is said to be 
primitive if the only constants that divide it are the units in R. For instance, x* + 3x + 
2and 3x* ~ 5x3 + 2x are primitive in Z[x]. Primitive polynomials of degree 0 are units. 
Every primitive polynomial of degree 1 must be irreducible by Theorem 10.1 (because 
every factorization includes a constant (Theorem 4.2) and every such constant must be 
a unit). However, primitive polynomials of higher degree need not be irreducible (such 
asx? + 3x + 2 = (x + 1)(x + 2) in Z[x)). On the other hand, an irreducible polynomial 
of positive degree has no constant divisors except units by Theorems 4.2 and 18.1. So 


an irreducible polynomial of positive degree is primitive. 
Furthermore, as the example illustrates, 


every nonzero polynomial f(x) E€ R]x]| 
factors as f(x) = ¢g(x) with g(x) primitive. 
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To prove this claim, let c be a greatest common divisor of the coefficients of f(x).* Then 
f(x) = cgx) for some g(x). Now we show that g(x) is primitive. If de R divides g(x), 
then g(x) = dh(x) so that x) = cdh(x). Since ed is a constant divisor of f(x), it must 
divide the coefficients of f(x) and, hence, must divide the gcd c. Thus cdu = c for some 
u ER. Since c # Og we see that du = lp and dis a unit. Therefore, g(x) is primitive. 

Using these facts about primitive polynomials, we can now modify the argument 
given at the beginning of the section and prove the first of the two conditions neces- 
sary for R[x] to bea UFD. 


Theorem 10,32 


Let R be a unique factorization domain. Then every nonzero, nonunit f(x) in 
R[x] is a product of irreducible polynomials.t 


Proof» Let f(x) = cg(x) with g(x) primitive. Since R is a UFD c is either a unit 
or a product of irreducible elements in R (and, hence, in R[x]). So we 
need to prove only that g(x) is either a unit or a product of irreducibles 
in R[x]. If g(x) is a unit or is itself irreducible, there is nothing to prove. 
If not, then by Theorem 10.1 g(x) = A(x)k(x) with neither h(x) or k(x) 

a unit. Since g(x) is primitive, its only divisors of degree 0 are units, so 
we must have 0 < deg h(x) < deg g(x) and 0 < deg k(x) < deg g(x). 
Furthermore, A(x) and k(x) are primitive (any constant that divides one 
of them must divide g(x) and hence be a unit). If they are irreducible, 
we're done. If not, we can repeat the preceding argument and factor 
them as products of primitive polynomials of lower degree, and so on. 
This process must stop after a finite number of steps because the degrees 
of the factors get smaller at each stage and every primitive polynomial 
of degree 1 is irreducible. So g(x) is a product of irreduciblesin R[x]. E 


The proof that factorization in R[x] is unique depends on several technical facts 
that will be developed next. But to get an idea of how all the pieces fit together, you 
may want to read the proof of Theorem 10.38 now, referring to the intermediate re- 
sults as needed and accepting them without proof. Then you can return to this point 
and read the proofs, knowing where the argument is headed. 


Lemma 10,33 


Let R be a unique factorization domain and g(x), A(x) € R[x]. If p is an irreduc- 
ible element of R that divides g{x)h(x), then p divides g(x) or p divides A(x). 


Proof > Copy the proof of Lemma 4.22, which is the special case R = Z. Just 
replace Z by R and prime by irreducible and use Theorem 10.15 in place 
of Theorem 1.5. E 


*The gcd c exists by Theorem 10.18. 
tAs usual we allow a “product” with just one factor. 
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Corollary 10.34 Gauss’s Lemma 


Let R be a unique factorization domain. Then the product of primitive 
polynomials in A[x] is primitive. 


Proof» xf g(x) and A(x) are primitive and g(x)A(x) is not, then g(x)h(x) is 
divisible by some nonunit c € R. Consequently, each irreducible factor p 
of c divides g(x)h(x). By Lemma 10.33, p divides g(x) or h(x), contradict- 
ing the fact that they are primitive. Therefore, g(x)h(x) is primitive. W 


Theorem 10.35 


Let R be a unique factorization domain and r, s nonzero elements of A. Let f(x) 
and g(x) be primitive polynomials in R[x] such that rf(x) = sg(x). Then r and s 
are associates in R and f(x) and g(x) are associates in R[x]. 


Proof» tf ris a unit, then f(x) = r-'sg(x). Since r-'s divides the primitive 
polynomial f(x), it must be a unit, say (7's) = 1g. Hence, f(x) and g(x) 
are associates in R[x]. Furthermore, u is a unit in R and su = r so that r 
and s are associates in R. 

If r is a nonunit, then r = p,p, - - - Pg with each p, irreducible. Then 
PiP2+ + + Py f(x) = sg(x), so p; divides sg(x). By Lemma 10.33 p, divides 
sor g(x). Since p; is a nonunit and g(x) is primitive, p, must divide s, say 
s = pıt. Then pp2- + + pe f(x) = sg(x) = pitg(x). Canceling p, shows 
that pz + - - pf (x) = tg(x). Repeating the argument with p, shows that 

- « Def (x) = zg(x), where pz = t and, hence, p,p.z = pıt = s. After 
k such steps we have f(x) = wg(x) and s = pP; - - - pew for some wE R. 
Since w divides the primitive polynomial f(x), w is a unit. Therefore, 
f(x) and g(x) are associates in R[x]. Since s = p, + - + pew = rv, r and s 
are associates in R. W 


Corollary 10.36 


Let R be a unique factorization domain and F its field of quotients. Let f(x), 

g(x) be primitive polynomials in R[x]. If f(x) and g(x) are associates in F[x], 

then they are associates in R[x]. 

Proof» xf Sx) ang a(x) are associates in F[x], then g(x) = — S) for some 
nonzero ~ eF by Corollary 4.5. Consequently, A = rf(x) in Rix]. 
Therefore, f(x) and g(x) are associates in R[x] by Theorem 10.35. m 


Corollary 10.37 


Let R be a unique factorization domain and F its field of quotients. If f(x) € Rix] 
has positive degree and is irreducible in A[x], then f(x) is irreducible in F[x]. 
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Proof» 1f f(x) is not irreducible in F[x], then f(x) = g(x)A(x) for some g(x), A(x) 
E F[x] with positive degree. Let b be a least common denominator of the 
coefficients of g(x). Then bg(x) has coefficients in R. So bg(x) = ag;(x) with 


aE Rand g,(x) primitive of positive degree in R[x]. Hence, g(x) = za. 

Similarly (x) = s h(x) with c dE Rand A(x) primitive of positive degree 
a c ac 

in Rix]. Therefore, f(x) = KOAA) = FEO AO = Fah), 


so that bdf(x) = acg,(x)h,(x) in R[x]. Now f(x) is primitive because it is 
irreducible and g,(x)h,(x) is primitive by Corollary 10.34. So dd is an as- 
sociate of ac by Theorem 10.35, say bdu = ac for some unit we R. 
Therefore, f(x) = POE) = = ug,(x)hy(x). Since ug,(x) and A,(x) are 


polynomials of -a degree in R[x], this contradicts the irreducibility 
of f(x). Therefore, f(x) must be irreducible in F[x]. E 


Theorem 10.38 


If R is a unique factorization domain, then so is R[x]. 


Proof» Every nonzero nonunit f(x) in R[x] isa product of irreducibles by 
Theorem 10.32. Any such factorization consists of irreducible constants 
(that is, irreducibles in R) and irreducible polynomials of positive degree. 
Suppose 


** Empi (x) * * * Pex) = dye + + dagl) + + + gd) 


with each c, d, irreducible in R and each p{x), gfx) irreducible of posi- 
tive degree in R[x] (and, hence, primitive).* Then p;(x) - > * p(x) and 
q(x) * + * gfx) are primitive by Corollary10.34. So Theorem 10.35 shows 
that cı + + + Cm is an associate of d, + ++ d in R and p(x) - + + p(x) is an 
associate of g,(x) + - - q(x) in R[x]. Hence, c1* + + Em = udda +++ d, for 
some unit u€ R. Associates of irreducibles are irreducible (Exercise 7 of 
Section 10.1), so wd, is irreducible. Since R is a UFD, we must have m = n 
and (after relabeling if necessary) c, isan associate of ud, (and hence of 
d,), and c, is an associate of d,for i= 2. Let F be the field of quotients 
of R. Each of the pfx), q(x) is irreducible in F[x] by Corollary 10.37. 
Unique factorization in F[x] (Theorem 4.14) and an argument simi- 

lar to the one just given for R show that & = ¢ and (after relabeling if 
necessary) each p{x) is an associate of g{x) in F[x]. Consequently, p{x) 
and q,(x) are associates in R[x] by Corollary 10.36. Therefore, R[x] is a 
UFD. E 


*It may be that neither factorization contains constants, but this doesn't affect the argument Itis not 
possible to have irreducible constants in one factorization but not in the other (Exercise 5). 
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An immediate consequence of Theorems 1.8 and 10.38 and Example 8 of 
Section 6.1 is 


Corollary 10.39 


2[x] is a unique factorization domain that is not a principal ideal domain. 


As illustrated in the preceding discussion, theorems about Z[x] and Q[x] are quite 
likely to carry over to an arbitrary UFD and its field of quotients. Among such results 
are the Rational Root Test and Eisenstein’s Criterion (Exercises 9-11). 


E Exercises 


NOTE: Unless stated otherwise R is a UFD and F its field of quotients. 


A. 1. Let R be any integral domain and p E R. Prove that p is irreducible in R if and 
only if the constant polynomial p is irreducible in R[x]. [Hint: Corollary 4.5 
may be helpful.] 


2. Give an example of polynomials f(x), g(x) € R[x] such that f(x) and g(x) are 
associates in F[x] but not in R[x]. Does this contradict Corollary10.36? 

3. If ci *** Ca f) = g(x) with ¢,€ Rand g(x) primitive in R[x], prove that each 
cis a unit. 

4. If g(x) is primitive in R[x], prove that every nonconstant polynomial in R[x] 
that divides g(x) is also primitive, 

B. 5. Prove that a polynomial is primitive if and only if 1, is a greatest common 
divisor of its coefficients. This property is often taken as the definition of 
primitive. 

6. If f(x) is primitive in R[x] and irreducible in F[x], prove that f(x) is irreducible 
in R[x]. 
7. If Ris aring such that R[x] isa UFD, prove that R is a UFD. 
8. If Ris aring such that R[x] is a principal ideal domain, prove that R is a field. 
9. Verify that the Rational Root Test (Theorem 4.21) is valid with Z and Q 
teplaced by R and F. 
10. Verify that Theorem 4.23 is valid with Z and Q replaced by Rand F. 


11. Verify that Eisenstein’s Criterion (Theorem 4.24) is valid with Z and Q 
replaced by R and F and prime replaced by irreducible. 


12. Show that xX? — 6x? + 4ix + 1 + 3iis irreducible in Z[i Diq]. 
[Hint: Exercise 11.] 
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CHAPTER Í] 


Field Extensions 


High-school algebra deals primarily with the three fields Q, R, and C and plane 
geometry, with the set R x R. Calculus is concerned with functions from R to R. 
Indeed, most classical mathematics is set in the field C and its subfields. Other 
fields play an equally important role in more recent mathematics. They are used in 
analysis, algebraic geometry, and parts of number theory, for example, and have 
numerous applications, including coding theory and algebraic cryptography. 

In this chapter we develop the basic facts about fields that are needed to prove 
some famous results in the theory of equations (Chapter 12) and to study some of 
the topics listed above. The principal theme is the relationship of a field with its 
various subfields. 


ma Vector Spaces 


An essential tool for the study of fields is the concept of a vector space, which is 
introduced in this section. Vector spaces are treated in detail in books and courses 
on linear algebra. Here we peesent only those topics that are needed for our study of 
fields. If you have had a course in linear algebra, you can probably skip most of this 
section. Nevertheless, it would be a good idea to review the main results, particularly 
Theorems 11.4 and 11.5. 

Consider the additive abelian group* M(R) of all 2 X 2 matrices over the field R 


b 
of real numbers. If ris a real number and A = (? i} is an element of M(R), then the 


*Except for the last two results in the chapter, group theory is not a prerequisite for this chapter. In 
this section you need only know that an additive abelian group is a set with an addition operation 
that satisfies Axioms 1-5 in the definition of a ring (page 44). 
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product of the number r and the matrix A is defined to be the matrix r4 = (= wr! 


This operation, which is called scalar multiplication, takes a real number (field ele- 
ment) and a matrix (group element) and produces another matrix (group element). 
This is an example of a more general concept. Let F be a field and G an additive abe- 
lian group.* Then a scalar multiplication is an operation such that for each a € F and 
each v E G there is a unique element av E G. 


Definition Let F be a field. A vector space over F is an additive abelian gřoup* V 
equipped with a scalar multiplication such that tor all a, au a& € F and v, 
Vir VWE V: 
(i) a(Vy + Vo) = av; + avz 
(ii) (@ + a)y = ayy + awi 
(jit) alaa) = (ayag)v; 
(iv) te =v. 


EXAMPLE 1 


Scalar multiplication in M(R), as defined above, makes M(R) into a vector 
space over R (Exercise 1). 


EXAMPLE 2 


Consider the set Q? = Q x Q, where Q is the field of rational numbers. Then 
Q? is a group under addition (Theorem 3.1 or 7.4); its zero element is (0, 0) and 
the negative of (s, #) is(—s, —#). For ac Q and (s, f) € Q?, scalar multiplication 
is defined by a(s, t) = (as, at). Under these operations Q? is a vector space over 
Q (Exercise 2). 


EXAMPLE 3 


The preceding example can be generalized as follows. If F is any field and x = 1 
an integer, let F” = F X FX +++ X F (n summands). Then F" is a vector space 
over F, with addition defined coordinatewise: 


(Sis $2, asta) + (4, faa sta) = (S1 a S PERY Tn a S A Sy + th) 
and scalar multiplication defined by: 
(5), S2 - - «5 Sn) = (aS, aS2, . . . , aSa) acF 


(see Exercise 5). 


*See the preceding footnote. 
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EXAMPLE 4 


The complex numbers C form a vector space over the real numbers R, with 
addition of complex numbers (vectors) defined as usual and with scalar mul- 
tiplication being ordinary multiplication (the product of a real number and a 
complex number is a complex number). 


Special terminology is used in situations like the preceding example. If F and Kare 
fields with F & K, we say that Kis. an extension field of F. For instance, the complex 
numbers C are an extension field of the field R of real numbers. As the preceding 
example shows, the extension field € can be considered as a vector space over R. The 
same thing is true in the general case. 


If K is an extension field of F, then K is a vector space over F, with 
addition of vectors being ordinary addition in K and scalar 
multiplication being ordinary multiplication in K 


(the product of an element the subfield F and an element of K is an element of K). 
For the purposes of this chapter, extension fields are the most important examples of 
vector spaces. 

If V is a vector space over a field F, then the following properties hold for any ve V 
and a E F (Exercise 21): 


0w = 0y a0y= Oy, — —(av) = (~av = a{—v). 


Spanning Sets 
Suppose V is a vector space over a field F and that w and %4, %,...,v, are elements 
of V. We say that w is a linear combination of 2, vz, . . . , % if w can be written in 
the form 

w = ayy + aw + + * + Ayr, 
for some qE F. 


Definition If every element of a vector space V over a field Fis a linear combination of 


Vi, Vor...) Vm We Say that the set {V}, Ve,..., Va} spans V over F. 


EXAMPLE 5 


The set {(1, 0, 0), (0, 1, 0), (0, 0, 1)} spans the vector space Q? over Q because 
every element (a, b, c) of Q? is a linear combination of these three vectors: 


(a, b, c) = a (l, 0, 0) + b (0, 1, 0) + e (0, 0, 1). 


EXAMPLE 6 


Every element of C (considered as a vector space over R) is a linear combina- 
tion of 1 and i because every element can be written in the form al + bi, with 
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a, bER. Thus the set {1, i} spans C over R. The set {1 + i, 5i, 2 + 3i} also 
spans € because any a + bie C is a linear combination of these three elements 
with coefficients in R: 


a+ bi = 3a(l + i) + + (50 + (—a)(2 + 33). 


Linear Independence and Bases 


The set {1, } not only spans the extension field C of R, but it also has this property: If 
al + bi = 0, then a = 0 and b = 0. In other words, when a linear combination of 1 and 
iis 0, then all the coefficients are 0. On the other hand, the set {1 + i, 5i, 2 + 32} does 
not have this property because some linear combinations of these elements are 0 even 
though the coefficients are not; for instance, 


(1 + i) + = (Si ~ 1(2 + 3i) = 0. 


The distinction between these two situations will be crucial in our study of field 
extensions. 


Definition A subset {V}, Və, . .. , Vn} Of a vector space V over a field F is said to be 
linearly independent over F provided that whenever 


Civi + Cove + +++ + Con = Oy 


with each c,EF, then &, = @ for every i. A set that is not linearly indepen- 
dent is said to be linearly dependent. 


Thus, a set {zi}, Up . . - 5 thy} is linearly dependent over F if there exist elements 
bi, ba, . «+5 Op, Of F, at least one of which is nonzero, such that byu; + byu + +++ + betim = Oy. 


EXAMPLE 7 


The remarks preceding the definition show that the subset {1, i} of C is linearly 
independent over R and that the set {1 + i, 5i, 2 + 34} is linearly dependent. 
Note, however, that both of these sets span C. 


EXAMPLE 8 


Consider the subset {(3, 0, 0), (0, 0, 4)} of the vector space Q? over Q and sup- 
pose ¢;, ¢,€Q are such that c,(3, 0, 0) + ¢2(0, 0, 4) = (0, 0, 0). Then 


(0, 0, 0) = ¢(3, 0, 0) + c,(0, 0, 4) = (3e,, 0, 4c), 
which implies that c, = 0 = c3. Hence, {(3, 0, 0), (0, 0, 4)} is linearly indepen- 
dent over Q. However, the set {(3, 0, 0), (0, 0, 4)} does not span Q? because 
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there is no way to write the vector (0, 5, 0), for example, in the form a,(3, 0, 0) 
+ a,(0, 0, 4) = (3a,, 0, 4a,) with ae Q. 


Let V be a vector space over a field F. The preceding examples show that linear 
independence and spanning do not imply each other; a subset of V may have one, 
both, or neither of these properties. A subset that has both properties is given a special 
name. 


Definition A subset {V}, Va ..., Vn} of a vector space V over a field F is said to be a 
basis of V if it spans V and is linearly independent over F. 


EXAMPLE 9 


Example 5 shows that the subset {(1, 0, 0), (0, 1, 0), (0, 0, 1)} spans the vector 
space Q? over Q. This set is also linearly independent over Q (Exercise 8) and, 
hence, is a basis. 


EXAMPLE 10 


Examples 6 and 7 show that the set {1, i} is a basis of C over R. We claim that 
the set {1 + å, 2i} is also a basis of C over R. If ci + ò + c,(2i) = 0, with c, 
ER, then ¢,1 + (c, + 2c,)i = 0. This can happen only if c = 0 and c; + 2¢, = 0. 
But this implies that 2c) = 0 and, hence, c3 = 0. Therefore, {1 + i, 2i} is linearly 
independent. In order to see that {1 + i, 2i} spans C, note that the element 


= “hat 


One situation always leads to linear dependence. Let V bea vector space over a field 
Fand Sa subset of V. Suppose that v, u1, u2,..., u; are some of the elements of S and 
that v is a linear combination of w, uz, ... , Un Say v = aju + *** + aru, with each 
qE F. If w, . . . , w, are the rest of the elements of S, then 


b 
a + bic C can be written as a(1 + i) + ( 


v = ayy +e + au, + Ope, + °° + One, 
and, hence, 
-lw + au ++ + au, + One, + +++ + Open, = Op. 


Since at least one of these coefficients is nonzero (namely —1 p), S is linearly dependent. 
We have proved this useful fact: 


If vc V isa linear combination of w,, iz, . . , , u, E ¥, then any set 
containing v and all the x; is linearly dependent. 


In fact, somewhat more is true. 
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Lemma 11.1 


Let V be a vector space over a field F. The subset {u,, Up, ..., Up} of Vis linearly 
dependent over F if and only if some u, is a linear combination of the preced- 
ing Ones, Uy, Un, «sı Ugie 


Proof if some «x is a linear combination of the preceding ones, then the set 
is linearly dependent by the remarks preceding the lemma. Conversely, 
suppose {2;,..., u,} is linearly dependent. Then there must exist elements 
Cis +. -s Cp EF, not all zero, such that cju, + Crug + ++ +> + Cyu, = Op. Let k 
be the largest index such that c, is nonzero. Then c, = 0, for i> k and 


Cju + can + O a Cpe = Oy 


Clg = —CyUy — Cog T * °° — Ckik- 


Since Fis a field and ¢, # 0, cy! exists; multiplying the preceding equa- 
tion by œ" shows that u; is a linear combination of the preceding u3: 


m= (—cre hu + (coc, un = cl (—ce—164 eet. a 


The next lemma gives an upper limit on the size of a linearly independent set. It 
says, in effect, that if V can be spanned by n elements over F, then every linearly inde- 
pendent subset of V contains at most n elements. 


Lemma 11.2 


Let V be a vector space over the field F that is spanned by the set 
{Vas Vor. + Vp} IF (Uy, Uas . «+, Up} is any linearly independent subset of V, then 
msn. 


Proof» By the definition of spanning, every element of V (in particular 2) is a 
linear combination of uy, . . . , v,. So the set fu, v1, %, ..., Up? is linearly 
dependent. Therefore, one of its elements is a linear combination of the 
preceding ones by Lemma 11.1, say v; = ayu + byny + +++ + bitir 
If v; is deleted, then the remaining set 


(+) {us Ujs e aas Ups Uth eres Up? 


still spans V since every element of V is a linear combination of the v’s 
and any appearance of v can be replaced by aju, + bn ++--++ 

bi-1 %_1- In particular, u, is a linear combination of the elements of the 
set (+). Consequently, the set 


fu, Up, Uy » -+ s Ugis Ugpts + +» Unt 


is linearly dependent. By Lemma 11.1 one of its elements is a linear 
combination of the preceding ones. This element can’t be one of the ws 
because this would imply that the u’s were linearly dependent. So some 
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ais a linear combination of w, u, and the v’s that precede it. Deleting x 
produces the set 


fuis Ups Vis o +s Vigs Vets ++ + s Yas Ug + + +s Unte 


This set still spans V since every element of V is a linear combination of 
the v’s and v, v; can be replaced by linear combinations of u;, uz, and the 
other w’s, In particular, w is a linear combination of the elements in this 
new set. We can continue this process, at each stage adding a u, deleting 
a v, and producing a set that spans V. If m > n, we will run out of v’s be- 
fore all the w’s are inserted, resulting in a set of the form {2;, wz, . . . , up? 
that spans V. But this would mean that u,, would be a linear combination 
of wy, ... , Ups contradicting the linear independence of {tt ..., Umg- 
Therefore, m< n. E 


Theorem 11.3 


Let V be a vector space over a field F. Then any two finite bases of V over F 
have the same number of elements. 


Proof» Suppose {u . . . , ùn} and {ur . . - , Vp) are bases of V over F. Then the 
vs span V and the w’s are linearly independent, so m = n by Lemma 11.2. 
Now reverse the roles: The u’s span V and the ¢’s are linearly indepen- 
dent, so n = m by Lemma 11.2 again. Therefore, m = n. E 


According to Theorem 11.3, the number of elements in a basis of V over Fdoes not 
depend on which basis is chosen. So this number is a property of V. 


Definition IfavectorspaceV overa field £ hasafinitebasis,then V is said to be finite 


dimensional over F. The dimension of V over F is the number of elements 
in any basis of V and is denoted [V:F]. If V does not have a finite basis, then 
V is said to be infinite dimensional over F. 


EXAMPLE 11 


The dimension of Q? over Q is 3 because {(1, 0, 0), (0, 1, 0), (0, 0, 1)} is a basis. 
More generally, if Fis a field, then F” is an n-dimensional vector space over F 
(Exercise 27). 


EXAMPLE 12 


[C:R] = 2 since {1, i} is a basis of C over R. On the other hand, the extension 
field R of Q is an infinite-dimensional vector space over Q. The proof of this fact 


is omitted here because it requires some nontrivial facts about the cardinality of 
infinite sets. 
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Applications to Extension Fields 


In the remainder of this section, K is an extension field of a field F. We say that K is 
a finite-dimensional extension of F if K, considered as a vector space over F, is finite 
dimensional over F. 


Remark If [K:F] = 1 and {vu} is a basis, then every element of K is of the form 
cu for some c€ F. In particular, 1p = cu, and, hence, u = ¢ 1 is in F. Thus, K = F. On 
the other hand, if K = F, it is easy to see that {1p} is a basis and, hence, [K:F] = 1. 
Therefore, 


[K:F|=1 ifandonlyif K=F. 


If F, K, and L are fields with FS K S L, then both K and L can be considered as 
vector spaces over F, and L can be considered as a vector space over K. It is reason- 
able to ask how the dimensions [K:F], [Z:K], and [L:F] are related. Here is the answer. 


Theorem 11,4 


Let F, K, and Ł be fields with F = K CL. If [K:F] and [£:K] are finite, then L isa 
finite-dimensional extension of F and [L:F] = [L:K][K:F]. 


Proof» Suppose [K:F] = m and [L:K] = n. Then there is a basis {u,..., 2%} of 
K over F and a basis {v;, . . . , a} of L over K. Each «and v,is nonzero 
by Exercise 19; hence, all the products uœ; are nonzero. The set of all 
products {u| 1 5 i 5 m, 1 5j = n} has exactly mn elements (no two 
of them can be equal because umy = u,v; implies that uw; — uzv, = Ox 
with u; ug E K, contradicting the linear independence of the v’s over K). 
We need to show only that this set of mn elements is a basis of L over F 
because in that case [L:K][K:F] = nm = [L:F]. 

If wis any element of L, then w is a linear combination of the basis 


elements v, . ... , Un, SAY 
(+) w = by, + by, + +++ + byu with each bE K. 
Each 6, Kis a linear combination of the basis elements u4, . . . , tp SO 


there are aye F such that 


By = ayy + agug +++ + anyum 
b, = au, + amin + +++ + Batty, 


b, = apii + agt H+ + + aptin 


Substituting the right side of each of these expressions in (*) shows that 
w is a sum of terms of the form au, with a, E F. Therefore, the set of 
all products ug, spans L over F. 
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To show linear independence, suppose cy E F and 


(++) Dep = cunv + ov + + Canty = Op, 
2J 


By collecting all the terms involving 2, then all those involving v, and 
so on, we can rewrite (++) as 


(Cut + cgus + >= + Cyt) 
+ (ci + egu + ++ + + Cyt v2 
Het (Guu + Catia + 2+ + Cnm) Un = Op. 


The coefficients of the v’s are elements of K, so the linear independence 
of the v’s implies that for each = 1,2,...,2 


Cytti + cajun + +++ + Cyytty, = Op 
Since each cy € F and the w’s are linearly independent over F, we must 


have cy = 0, for all i, 7. This completes the proof of linear independence, 
and the theorem is proved. W 


The following result will be needed for the proof of Theorem 11.15 in Section 11.4. 


Theorem 11.5 


Let K and L be finite dimensional extension fields of F and let f:K — L be an 
isomorphism such that f(c) = c for every ce F. Then [K:F] = (L:F]. 


Proof > Suppose [K:F] = n and {u,..., u,} is a basis of K over F In order to 
prove that [L:F] = n also, we need only show that { f(u), ..., f(us)} is 
a basis of L over F Let v e L; then since f is an isomorphism, v = f(u) 
for some ue K. By the definition of basis, u = qu, + +++ + cpu, with 
each cE F. Hence, v = f(u) = flea + ++ + + cyu) = Refu) + °° + 
Sc, f(u,). But f(c) = c for every i, so that v = eyf(u) +--+ + cfl). 
Therefore, { f (21), . . . , f(t} spans L. To show linear independence, 
suppose that 


Ay f(a) + +++ + daflu) = Or 
with each d,e F. Then since f(d) = d, we have 
S (Gy + +++ + ditty) = f(d Yf) + +» + (GDF (Un) 
= df(u) + +++ + df(up) = Op. 
Since the isomorphism fis injective, du, + + «+ + dytin = Op by Theorem 6.11. 


But the v's are linearly independent in K, and, hence, every d, = 0p Thus 
{fav ..-, fd} is linearly independent and, therefore, a basis. W 
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Field Extensions 


E Exercises 


NOTE: V denotes a vector space over a field F, and K denotes an extension field of F. 


A. 1. 
2. 
3. 


4. 


Show that M(R) is a vector space over R. 

Show that Q? is a vector space over Q. 

Show that the polynomial ring R[x] (with the usual addition of polynomials 
and product of a constant and a polynomial) is a vector space over R. 

If n = 1 is an integer, let R [x] denote the set consisting of the constant 
polynomial 0 and all polynomialsin R[x] of degree = n. Show that R, [x] 
(with the usual addition of polynomials and product of a constant and a 
polynomial) is a vector space over R. 


5. If n= 1 is an integer, show that F” is a vector space over F. 


B. 21. 


. If {u vy, --., Ua} Spans K over F and w is any element of K, show that 
{w, v1, Vz » » +> Un} also spans K. 
. Show that {i, 1 + 2é, 1 + 3i} spans C over R. 


. Show that the subset {(1, 0, 0), (0, 1, 0), (0, 0, 1)} of Q? is linearly independent 


over Q. 


. Show that { V2, V2 + i,V3 — i} is linearly dependent over R. 
. If vis anonzero element of V, prove that {v} is linearly independent over F. 
. Prove that any subset of V that contains 0, is linearly dependent over F. 


. If the subset {u, v, w} of Vis linearly independent over F, prove that 


{u, u + u, u +v + w} is linearly independent. 


. If S= {u,..., %} isa linearly dependent subset of V, then prove that any 


subset of V that contains S is also linearly dependent over F. 


. If the subset T ={w,..., u} of Vis linearly independent over F, then prove 


that any nonempty subset of T is also linearly independent. 


. Let b and d be distinct nonzero real numbers and c any real number. Prove that 


{b, e + di} is a basis of C over R. 


. If Kis an n-dimensional extension field of Z,, what is the maxinmm possible 


number of elements in K? 


. Let {x,..., Un} be a basis of V over F and let cy, .. . , %, be nonzero elements 


of F. Prove that {¢,2, C72, . «+, €,U,} is also a basis of V over F. 


. Show that {1, [x]} is a basis of Z2[x]/(x? + x + 1) over Zp. 
. If {vi 02... , Ya? is a basis of v, prove that v; # Oy for every i. 
. Let F, K, and L be fields such that Fo KC L. If $ = {y, x, ..., vy} spans L 


over F, explain why S also spans L over K. 

For any vector v E V and any element a E F, prove that 
(a) 0, = Oy. [Hint: Adapt the proof of Theorem 3.5.] 
(b) a0, = Oy. 

© -(av) = (a) v = a-2). 
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22. (a) Prove that the subset {1, V23} of R is linearly independent over Q. 


(b) Prove that V3 is not a linear combination of 1 and V2 with coefficients in 
Q. Conclude that {1, V2} does not span R over Q. 


23. (a) Show that {1, V2, V3} is linearly independent over Q. 
(b) Show that {1, V2, V3, v6 is linearly independent over Q. 


24. Letv be a nonzero real number. Prove that {1, w} is linearly independent over 
Q if and only if v is irrational. 


25. (a) Letk = 1 be an integer. Show that the subset 1, x, x7, x°,...,*} of Rix] 
is linearly independent over R (see Exercise 3). 


(b) Show that R[x] is infinite dimensional over R. 
26. Show that the vector space R,,[x] of Exercise 4 has dimension n + 1 over R. 
27. If Fis a field, show that the vector space F” has dimension n over F. 
28. Prove that K has exactly one basis over F if and only if K = F = 2. 


29. Assume lp + Ip + Or. If {u, v, w} isa basis of V over F, prove that the set 
{u + v, v + w, u + w} is also a basis. 


30. Prove that {v;, . . . , va} is a basis of V over F if and only if every element of V 
can be written in a unique way as a linear combination of v4, . . . , v, (“unique” 
means that if w = cyu, +--+ + Cun and w = divi + ++ + + d,v,, then c = d, 
for every i). 

31. Let p(x) = œ + ax +++: + + a,x" be irreducible in F[x] and let L be the 
extension field F[x]/(p(x)) of F. Prove that L has dimension n over F. 

[Hint: Corollary 5.5, Theorems 5.8 and 5.10, and Exercise 30 may be helpful.] 


32. If S= {2,...,v,} spans F over F, prove that some subset of S is a basis of K 
over F. [Hint: Use Lemma 11.1 repeatedly to eliminate v’s until you reduce toa 
set that still spans V and is linearly independent.] 


33. If the subset {),..., u,} of Vis linearly independent over Fand we Vis nota 
linear combination of the u's, prove that {2;,... , u,, w} is linearly independent. 


34. If Vis infinite-dimensional over F, then prove that for any positive integer k, 
V contains a set of k vectors that is linearly independent over F. [Hint: Use 
induction; Exercise 10 is the case k = 1, and Exercise 33 can be used to prove 
the inductive step] 


35. Assume that the subset {v;, . . . , Va} of Vis linearly independent over F and that 
w = cit + +++ + Gtp With c, EF. Prove that the set {w —w, w — v,..., w— Uy} 
is linearly independent over Fif and only if a+ ++: +o # Ip 


36. Assume that V is finite-dimensional over F and S is a linearly independent 
subset of V. Prove that S is contained in a basis of V. [Hint: Let [V:F] =n 
and S = {uj ..., t}; then m =< n by Lemma 11.2. If S does not span V, 
then there must be some w that is not a linear combination of the u’s. Apply 
Exercise 33 to obtain a larger independent set; if it doesn’t span, repeat the 
argument. Use Lemma 11.2 to show that the process must end with a basis that 
contains S.] 
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37. Assume that [V:F] = n and prove that the following conditions are equivalent: 
(i) {y,..., Yn} spans V over F., 
(ii) {x,...,u,} is linearly independent over F. 
(iii) {v,,..., Yp} is a basis of V over F. 


38. Let F, K, and L be fields such that FS K&L. If [L:F] is finite, then prove that 
[L:K] and [K:F] are also finite and both are = [L:F]. [Hint: Use Exercises 20 
and 32 to show that [L:K] is finite. To show that [K:F] is finite, suppose 
[L:F] = n. The set {1x} is linearly independent by Exercise 10; if it doesn’t 
span K, proceed as in the hint to Exercise 36 to build larger and larger linearly 
independent subsets of K. Use Lemma 11.2 and the fact that [L:F] = n 
to show that the process must end with a basis of K containing at most n 
elements.] 


39. If [K:F] = p, with p prime, prove that there is no field E such that F ÇEÇK. 
[Hint: Exercise 38 and Theorem 11.4.] 


11.2 Simple Extensions 


Field extensions can be considered from two points of view. You can look upward from 
a field to its extensions or downward to its subfields. Chapter 5 provided an example 
of the upward point of view. We took a field F and an irreducible polynomial p(x) in 
F[x] and formed the field of congruence classes (that is, the quotient field) F[x]/(p(x)). 
Theorem 5.11 shows that F[x]/(p(x)) is an extension field of F that contains a root 
of p(x). 

In this section we take the downward view, starting with a field K and a subfield F. 
If u € K, what can be said about the subfields of K that contain both u and F? Is there 
a smallest such subfield? If u is the root of some irreducible p(x) in F[x], how is this 
smallest subfield related to the extension field F[x]/(p(x)), which also contains a root 
of p(x)? 

The theoretical answer to the first two questions is quite easy. Let K be an extension 
field of F and u E K. Let F(u) denote the intersection of all subfields of K that contain 
both F and u (this family of subfields is nonempty since K at least is in it). Since the 
intersection of any family of subfields of K is itself a field (Exercise 1), F(u) is a field. 
By its definition, F(u) is contained in every subfield of K that contains F and u, and, 
hence, F{u) is the smallest subfield of K containing F and u. F(u) is said to be a simple 
extension of F. 

As a practical matter, this answer is not entirely satisfactory. A more explicit 
description of the simple extension field F(u) is needed. It turns out that the structure 
of F(u) depends on whether or not u is the root of some polynomial in F[x]. So we 
pause to introduce some terminology. 


Definition Anelement u of an extension field K of Fis said to be algebraic over F if u is 
the root of some nonzero polynomial in F[x]. An element of K that is not the 


root of any nonzero polynomial in F[x] is said to be transcendental over £, 
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EXAMPLE 1 


In the extension field C of R, i is algebraic over R because / is the root of x7 + 1€ 
Rix]. You can easily verify that element 2 + i of Cis a root of x — x? — 7x + 15€ 


Q[x]. Thus 2 + i is algebraic over Q. Similarly, W3 is algebraic over Q since it 
is a root of xf — 3. 


EXAMPLE 2 


Every element c in a field F is algebraic over F because c is the root of x — e e F[x]. 


EXAMPLE 3 


The real numbers 7 and e are transcendental over Q (proof omitted). Hereafter 
we shall concentrate on algebraic elements. For more information on transcen- 
dental elements, see Exercises 10 and 24-26. 


If u is an algebraic element of an extension field K of F, then there may be many 
polynomials in F[x] that have u as a root. The next theorem shows that all of them 
are multiples of a single polynomial; this polynomial will enable us to give a precise 
description of the simple extension field F(u). 


Theorem 11.6 


Let K be an extension field of F and u € K an algebraic element over F. Then 
there exists a unique monic irreducible polynomial p(x) in F[x] that has uasa 
root. Furthermore, if y is a root of g(x} efix], then p(x) divides g(x). 


Proof > Let S be the set of all nonzero polynomials in F[x] that have u as a root. 
Then S is nonempty because u is algebraic over F. The degrees of poly- 
nomials in S form a nonempty set of nonnegative integers, which must 
contain a smallest element by the Well-Ordering Axiom. Let p(x) be a 
polynomial of smallest degree in S. Every nonzero constant multiple 
of p(x) is a polynomial of the same degree with u as a root. So we can 
choose p(x) to be monic (if it isn’t, multiply by the inverse of its leading 
coefficient). 

If p(x) were not irreducible in F[x], there would be polynomials k(x) 
and #(x) such that p(x) = k(x)t(x), with deg k(x) < deg p(x) and deg t(x) < 
deg p(x). Consequently, k(u)t(u) = p(u) = Or in K. Since K is a field either 
k(x) = Opor t(u) = Op thatis, either k(x) or (x) is in S. This is impossible 
since p(x) is a polynomial of smallest degree in S. Hence, p(x) is irreducible. 

Next we show that p(x) divides every g(x) in S. By the Division 
Algorithm, g(x) = p(x)g¢(x) + r(x), where r(x) = Opor deg r(x) < deg p(x). 
Since v is a root of both g(x) and p(x), 


r(u) = g(u) — plugu) = Op + Org(u) = Op 
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So u is a root of r(x). If r(x) were nonzero, then r(x) would be in S, 
contradicting the fact that p(x) is a polynomial of smallest degree in S. 
Therefore, r(x) = 0p so that g(x) = p(x)g(x). Hence, p(x) divides every 
polynomial in S. 

To show that p(x) is unique, suppose t(x) is a monic irreducible 
polynomial in S. Then p(x) | t(x). Since p(x) is irreducible (and, hence, 
nonconstant) and #(x) is irreducible, we must have t(x) = cp(x) for some 
cE F. But p(x) is monic, so c is the leading coefficient of cp(x) and, 
hence, of t(x). Since t(x) is monic, we must have ¢ = 1p. Therefore, p(x) = 
t(x) and p(x) is unique. W 

If Kis an extension field of F and uE X is algebraic over F, then the monic, irre- 
ducible polynomial p(x) in Theorem 11.6 is called the minimal polynomial of u over F. 
The uniqueness statement in Theorem 11.6 means that once we have found any monic, 
irreducible polynomial in F[x] that has u as a root, it must be the minimal polynomial 
of u over F. 


EXAMPLE 4 


xX — 3 isa monic, irreducible polynomial in Q[-x] that has V3 ER as a root. 
Therefore, x? — 3 isthe minimal polynomial of V3 over Q. Note that x? — 3 is 
reducible over R since it factors as (x —V3)(x + V3) in R[x]. So the minimal 
polynomial of ‘V3 over R is x — V3, which is monic and irreducible in R[x]. 


EXAMPLE 5 


Let u = V3 + V5ER. Then u? = 3 + 2V3V5 + 5 = 8 + 2VI5. Hence, 

w? — 8 = 2V15 so that (x? — 8)? = 60, or, equivalently, (u? — 8)* — 60 = 0. 
Therefore, u = V3 + V5 is a root of (x?— 8)? — 60 = x4 — 16x? + 4E Q[x]. 
Verify that this polynomial is irreducible in Q[x] (Exercise 14). Hence, it must 
be the minimal polynomial of V3 + V5 over Q. 


The minimal polynomial of u provides the connection between the upward and 
downward views of simple field extensions and allows us to give a useful description 


of F(u). 


Theorem 11.7 


Let K be an extension field of F and ue K an algebraic element over F with 
minimal polynomial p(x) of degree n. Then 


(1) Flu) = FA). 
(2) {1a u, matt a is a basis of the vector space F(u) over F. 
(3) [F(u): F] =a. 
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Theorem 1 1.7 shows that when v is algebraic over F, then Fu) does not depend on K 
but is completely determined by F[x] and the minimal polynomial p(x). Consequently, 
we sometimes say that F(u) is the field obtained by adjoining z to F. 


Proof of Theorem 11.7 > (1) Since F(z) is a field containing u, it must contain 
every positive power of u. Since F(u) also contains F, F(z) must 
contain every element of the form by + bju + ba? ++ ++ + bu 
with ,¢ F, that is, (u) contains the element f(u) for every f(x) € Fix]. 
Verify that the map g:F [x] > F(u) given by 9( /(x)) = f(u) is a 
homomorphism of rings. A polynomial in F[x] is in the kemel of ¢ 
precisely when it has u as a root. By Theorem 11.6 the kernel of o 
is the principal ideal (p(x)). The First Isomorphism Theorem 6.13 
shows that F[x]/(p(x)) is isomorphic to Im ¢ under the map that sends 
congruence class (coset) [f(x)] to f(u). Furthermore, since p(x) is 
irreducible, the quotient ring F[x]/(p(x)), and, hence, Im 9, are fields 
by Theorem 5.10. Every constant polynomial is mapped to itself by œ 
and ọ(x) = u. So Im ¢ isa subfield of F(z) that contains both F and 
u. Since F(u) is the smallest subfield of K containing F and u, we must 


have F(u) = Im g = F[x]/(p(x)). 


(2) and (3) Since F(u) = Im 9, every nonzero element of F(z) is 
of the form f(u) for some f(x) € F[x]. If deg p(x) = n, then by the 
Division Algorithm f(x) = p(x)¢(x) + r(x), where r(x) = bg + bix + 

+B, pte F [x]. ie A Stu) = = = p(u)a(u) + r(u) = Opq(u) + 
r(u) = r(u) = bolz + biu + +*+ +b, 12"). Therefore, the set 
{lr u, u?,..., u")} spans Flu). To show that this set is linearly 
independent, suppose c + cju + - «+ + ep pu”! = Op with each 
cE F. Then u is a root of cy + cix +--+ + ¢,_,x"~', so this poly- 
nomial (which has degree = n—1) must be divisible by p(x) (which 
has degree n). This can happen only when cg + cix ++ °° + Cpi: 
is the zero polynomial; that is, each ¢, = 0p Thus {1p, u, u2,..., 21} 
is linearly independent over F and, therefore, a basis of F(u). 

Hence, [F(u): F] =”. m 


EXAMPLE 6 


The minimal polynomial of V3 over Q is x? — 3. Applying Theorem 11.7 with n = 
we see that {1, V3} is a basis of Q( V3) over Q, whence [Av3): Qj=2. a 
Example 5 shows that V3 + V5 has minimal polynomial x* — 16x” + 4 over Q so 
that [Q(V3 + V5): Q] =4 and {1, V3 + V5, (V3 + V5}, (V3 + V5)} isa 
basis. 


An immediate consequence of Theorem 11.7 is that 
if u and v have the same minimal polynomial p(x) 


in F[x), then F(z) is isomorphic to F(w). 
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The reason is that both F(u) and F(v) are isomorphic to F[x]/(A{x)) and, hence, to 
each other. Note that this result holds even when u and v are not in the same extension 
field of F. The remainder of this section, which is not needed until Section 11.4, deals 
with generalizations of this idea. We shall consider not only simple extensions of the 
same field, but also simple extensions of two different, but isomorphic, fields. 

Suppose F and E are fields and that g:F — E is an isomorphism. Verify that the 
map from F[x] to E[x] that maps f(x) = a) + a,x + a,x? + » - - + a,x" to the polyno- 
mial of (x) = o(ao) + o(a,)x + (ax? + «++ + o(@,)* is an isomorphism of rings 
(Exercise 21 in Section 4.1). Note that if f(x) = c is a constant polynomial in F[x] 
(that is, an element of F), then this isomorphism maps it onto g(c) € E. Consequently, 
we say that the isomorphism F[x] — E[x] extends the isomorphism o:F — E, and we 
denote the extended isomorphism by g as well. 


Corollary 11.8 


Let o:F — E be an isomorphism of fields. Let u be an algebraic element in 
some extension field of F with minimal polynomial p(x) EF[x]}. Let v be an 
algebraic element in some extension field of E, with minimal polynomial 
op(x) € Eix]. Then g extends to an isomorphism of fields m:F(u) — E(v) such 
that a(u) = v and o(c) = o(c) for every c EF. 


The special case when g is the identity map F — F states whenever u and v have 
the same minimal polynomial, then Hu) = Hv) under a function that maps u to v and 
every element of F to itself. 


Proof of Corollary IL.8> The isomorphism g extends to an isomorphism (also 
denoted g) F [x] —> E[x] by the remarks preceding the corollary. The proof 
of Theorem 11.7 shows that there is an isomorphism T:E[x]/(ap(x)) => E(v) 
given by 7((2(x)]) = gv). Let 7 be the surjective homomorphism 


E[x] > Eix] op) 
that maps g(x) to [g(x)] and consider the composition 
Fix] — Efe] > EIo) Elo) 
fx) —— of (x) —— [of] —— of). 


Since all three maps are surjective, so is the composite function. The 
kernel of the composite function consists of all A(x) € F[x] such that 
oh(v) = Og. Since T is an isomorphism, gh(v) = Og if and only if [r A(x) 
is the zero class in E[x]/(ap(x)), that is, if and only if h(x) is a mul- 
tiple of ap(x). But if h(x) = k(x) - ap(x), then applying the inverse of 
the isomorphism o shows that A(x) = a! (k(x))p(x). Thus the kernel of 
the composite function is the principal ideal (p(x)) in F[x]. Therefore, 
F[x]/(o(x)) = E(v) by the First Isomorphism Theorem 6.13; the proof 


Ce 00 Canara Lessing A Daa aai Say x be pled wane = or Gapir aat, in whats or ka part Dus to elsctrunic cights, mma third pasty contest may be exppramend firun the aBlock eaivor a hegnar(s). Ediarial seview has 
that eny appr costed does ict ostaniefly «ffect the oven (earning expeiare Cangege Learning carve the right orana ef@tionel cuinat at ny tune if suhenquen) rights ostrictim» require it. 


11.2 Simple Extensions 381 


of that theorem shows that this isomorphism (call it 6) is given by 
D = afv). Note that @([x]) = v and that for each ce € F, 0([c]) = 
a(c). So we have the following situation, where ¢ is the isomorphism of 
Theorem 11.7: 


Flu] Fixe) —> Eo) 
fld——— [we] —— ofo) 
ce —— [d ——_— a (0) cEF. 


The composite function 0 ° p`! : Ru) > E(w) is an isomorphism that ex- 
tends g and maps u to v. E 


EXAMPLE 7 


The polynomial x? — 2 is irreducible in Q[x] by Eisenstein’s Criterion. It has a 
root in R, namely 2. Verify that ¥/2w is also a root of x3 — 2 in C, where 
-1+ 
- 5 V3i 5 — is a complex cube root of 1. Applying Corollary 11.8 to the 
identity map Q —> Q we see that the real subfield Q(¥/2) is isomorphic to 
the complex subfield Q(¥/2w) under a map that sends ¥/2 to W/2w and each 
element of Q to itself. 


E Exercises 


NOTE: Unless stated otherwise, K is an extension field of the field F. 


A. 1. Let {Ze I} be a family of subfields of K. Prove that Oe is a subfield of K. 


. If u EK, prove that Fu’) S Fu). 
. If we Kand cE F, prove that F(u + c) = Fu) = Feu). 
. Prove that Q(3 + i) = Q(1 — ù. 
. Prove that the given element is algebraic over Q: 
(a)3+5 (b) VI-V2 @1+%2 
6. If we K and x? is algebraic over F, prove that u is algebraic over F 


7. If Lisa field such that FO K& L and uE Lis algebraic over F, show that w is 
algebraic over K. 


nN a U Nm 


8. If u, vE K and u + vis algebraic over F, prove that u is algebraic over F(z). 
9. Prove that Vr is algebraic over Q(x). 


10. If u € Kis transcendental over F and 07 # cE F, prove that each of u + 1p cu, 
and u? is transcendental over F. 


11. Find [Q(Y2): Q]. 
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12. If a + bie C and b + 0, prove that C = R(a + Bi). 


13. If [K:F] is prime and u E K is algebraic over F, show that either F(u) = Kor 
Flu) = F. 
14. Prove that x* — 16x? + 4 is irreducible in Q[x]. 
B. 15. Show that every element of C is algebraic over R [Hint: See Lemma 4.29.] 


16. If u EK is algebraic over F and ¢ E F, prove that u + 1,and cu are algebraic 
over F. 


17. Find the minimal polynomial of the given element over Q: 
(a) V1+V5 (b) V3i+ V2 
18. Find the minimal polynomial of V2 + i over Q and over R. 


19. Let u be an algebraic element of K whose minimal polynomial in F [x] has prime 
degree. If E is a field such that FS E & F(z), show that E = For E = Ku). 


20. Let u be an algebraic element of K whose minimal polynomial in F[x] has odd 
degree. Prove that F(u) = Flu”). 


21. Let F = Q(z‘) and K = Q(z). Show that 7 is algebraic over Fand find a basis 


of K over F. 
22. If rand s are nonzero, prove that Q( Vr) = Q( V5) if and only if r = £s for 
some tE Q. 


23. If K is an extension field of Q such that [K:Q] = 2, prove that K = Q (Vd) for 
some square-free integer d. [Square-free means dis not divisible by p° for any 
prime p.] 

24. If u EK is transcendental over F, prove that F{u) = F(x), where F(x) is the 


field of quotients of F[x], as in Example 1 of Section 10.4. [Hint: Consider the 
map from F(x) to F(u) that sends f(x)/g(x) to f(e)g(%4)7}.] 


25. If u E€ Kis transcendental over F, prove that all elements of F(u), except those 
in F, are transcendental over F. 


26. Let F(x) be as in Exercise 24. Show that 
over F. 


113 Algebraic Extensions 


The emphasis in the last section was on a single algebraic element. Now we consider 
extensions that consist entirely of algebraic elements. 


2a i € F(x) is transcendental 


Definition An extension field K of a field F is said to be an algebraic extension of F if 


every element of K is algebraic over F, 
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EXAMPLE 1 


If a + biE C, then a + biis a root of 
(x — (a + bi)\(x — (a — bi) = x? — 2ax + (2 + b5 eRfx]. 


Therefore, a + bi is algebraic over R, and, hence, C is an algebraic extension 
of R. On the other hand, neither C nor R is an algebraic extension of Q since 
there are real numbers (such as ~ and e) that are not algebraic over Q. 


Every algebraic element « over F lies in some finite-dimensional extension field 
of F, namely F(z), by Theorem 11.7. On the other hand, if we begin with a finite- 
dimensional extension of F we have 


Theorem 11.9 


If K is a finite-dimensional extension field of F, then K is an algebraic exten- 
sion of F. 


Proof» By hypothesis, K has a finite basis over F, say {v}, vo, ..., Up}. Since 
these z elements span K, Lemma 11.2 implies that every linearly inde- 
pendent set in K must have n or fewer elements. 

If u E€ K, there are two possibilities: (1) «’ = u/ with 0 < i < j; and 
(2) all nonnegative powers of u are distinct. In Case (1), u is a root of 
the polynomial x! — x/ € F[x] and hence, is algebraic over F. In Case (2), 
{1p, u, u2,..., u”} is a set of n + 1 elements in K and must, therefore, be 
linearly dependent over F. Consequently, there are elements ¢, in F, not 
all zero, such that colp + cju + czu? +- - + cpu” = Op Therefore, u is 
the root of the nonzero polynomial co + cix + cx? +--+ cp in F[x] 
and, hence, algebraic over F. W 


If an extension field K of F contains a transcendental element u, then K must 
be infinite dimensional over F (otherwise u would be algebraic by Theorem 11.9). 
Nevertheless, the converse of Theorem 11.9 is false since there do exist infinite- 
dimensional algebraicextensions (Exercise 16). 

Simple extensions have a nice property. You need only verify that the single ele- 
ment u is algebraic over F to conclude that the entire field F(u) is an algebraic 
extension (because F(z) is finite dimensional by Theorem 11.7 and, hence, algebraic 
by Theorem 11.9). This suggests that generalizing the notion of simple extension 
might lead to fields whose algebraicity could be determined by checking just a finite 
number of elements. 

If ui, . . . , u, are elements of an extension field K of F, let 


Flu, U2, -o 5 Up) 


denote the intersection of all the subfields of K that contain F and every u, As in the case 
of simple extensions, F(t), . . . , 2,,) is the smallest subfield of K that contains Fand all the 
ty. Flug, .. «5 Up) is said to be a finitely generated extension of F, generated by u4, . . . , tt. 
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EXAMPLE 2 


The field a(v3, i) is the smallest subfield of C that contains both the field Q 
and the elements V3 and i. 


EXAMPLE 3 


A finitely generated extension may actually be a simple extension. For instance, 
the field Q(i) contains both i and — i so Q(i, — i) = Q(Ñ. 


EXAMPLE 4 


Every finite-dimensional extension is also finitely generated. If {u,,..., u,] is 
a basis of K over F, then all linear combinations of the u (coefficients in F) are 
in Fu, ..., tp). Therefore, K = F(uj,... , Un): 


The key to dealing with finitely generated extensions is to note that they can be 
obtained by taking successive simple extensions. For instance, if K is an extension 
field of F and u, vE K, then F(u, v) is a subfield of K that contains both F and u 
and, hence, must contain F(u). Since v is in F(u, v), this latter field must contain 
F(u)(v), the smallest subfield containing both F(u) and v. But F(u)(v) is a field 
containing F, u, and v and, hence, must contain F(z, v). Therefore, Ru, v) = F(u)(v). 
Thus the finitely generated extension F(u, v) can be obtained from a chain of simple 
extensions: 


Fs Flu) S F(u)(v) = F(u, v). 
EXAMPLE 5 


The extension field Q(Vv3, i) can be obtained by this sequence of simple 
extensions: 


Qs Q(V3) s Q(V3Xi) = Q(V3, i). 


As we saw in Example 4 of Section 11.2, x* — 3 is the minimal polynomial 
of V3 over Q, so that [Q( V3): Q] = 2 by Theorem 11.7. Similarly, x? + 1 
[whose coefficients are in Q(v3) is the minimal polynomial of i over Q(V3) 
because its roots ti are not in am so xX? + lis irreducible over Q(v3) by 


Corollary 4.19. By Theorem 11.7 again, [a(v3)O:Q( v3)] = 2. Consequently, 
by Theorem 11.4, 


[Q(-V3, i):Q] = [Q(VIO:A(VIA(v3):Q] = 2 -2 = 4. 


Thus, the finitely generated extension Q( V3, i) is finite dimensional and, hence, 
algebraic over Q by Theorem 11.9. 
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Essentially the same argument works in the general case and provides a useful way 
to determine that an extension is algebraic: 


Theorem 11.10 


If K = Hur . .. , Up) is a finitely generated extension field of F and each uy, is 
algebraic over F, then K is a finite-dimensional algebraic extension of F., 


Proof» The field K can be obtained from this chain of extensions: 
Fo Fu) © Hu, w) & Hu, Ur ty) S aie 
Fu, .. - s Un) S Fu, sees Uy) = 


Furthermore, F(u, u) = Flu;)(u2), Flu, uo, u) = F(t, 2) (u), and in 
general F(u;,..., u) is the simple extension F(u;,..., u,i)(u,. Each w 
is algebraic over F and, hence, algebraic over F(u, . . . , u1) by Exercise 
7 of Section 11.2. But every simple extension by an diabetic element is 
finitedimensional by Theorem 11.7. Therefore, 


[Fluy,..., u):Flu, ... p i) 


is finite for each i = 2, . . . , n. Consequently, by repeated application of 
Theorem 11.4, we see that [K:F ] is the product 


[K:Fluy, - = 5 tad) > > + [Fe ty, 1): Fling, Fun U2):F(u)P(e):F 1. 
Thus [K:F] is finite, and, hence, Kis algebraic over F by Theorem 11.9. E 


EXAMPLE 6 


Both V3 and V5 are algebraic over Q, so Q( V3, v5) is a finite-dimensional 
algebraic extension field of Q by Theorem 11.10. We can calculate the dimen- 
sion of Q( V3, V5) over Q by considering this chain of simple extensions: 


ac (V3) (V3)(V5) = O(V3, V5). 


We know that [Q(V3):Q] = 2. To determine [Q(-V3)(‘V5):Q(V3)] we shall 
find the minimal polynomial of V5 over Q(V3).The obvious candidate is 
X? — 5; it is irreducible in Q[x], but we must show that itis irreducible over 
Q(v3), in order to conclude that it is the minimal polynomial. If V5 or —V5 
is in Q( V3), then +V5 = æ + bV3, with a, b E Q. Squaring both sides shows 


that 5 = a’ + 2abV/3 + 347, whence V3 = as a 


the fact that V3 is irrational; a similar contradiction results if a = 0 or b = 0. 
Therefore, + V5 are not in Q( V3), and, hence, x? — 5 is irreducible over Q(V3) 
by Corollary 4.19. So x? — 5 is the minimal polynomial of V5 over Q('V3), and 
[Q(-V3)(V5) : Q('V3)] = 2 by Theorem 11.7. Consequently, by Theorem 11.4 


[Q(v3, V5):Q] = [A(V3X V5} VIJA(V3)}0] = 2-2 = 4. 


, contradicting 
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The remainder of this section is not used in the sequel. Theorem 11.4 tells us that 
the top field in a chain of finite-dimensional extensions is finite dimensional over the 
ground field. Here is an analogous result for algebraic extensions that may not be finite 
dimensional. 


Corollary 11.11 


fZ is an algebraic extension field of K and K is an algebraic extension field of 
F, then L is an algebraic extension of F. 


Proof» Let u € L. Since u is algebraic over K, there exist a, € K such that 
a + aju + ag? +--+ + apu” = 0x Since each of the ais in the field 
F(a, ..., Gm), u is actually algebraic over F(a), . . . , am). Consequently, 
in the extension chain 


FS Fa... , âm) S Flay, . +5 Apu) = Flay, «s am U) 


F(a, . . . , 4)(u) is finite dimensional over F(a,, . . . , @,) by Theorem 11.7. 
Furthermore, [F(a), . . « 5 &n):F] is finite by Theorem 11.10 since each a, is 
algebraic over F. Therefore, F(a, . . . , am, u) is finite dimensional over F 
by Theorem 11.4 and, hence, is algebraic over F by Theorem 11.9. Thus 
u is algebraic over F. Since u was an arbitrary element of L, L is an alge- 
braic extension of F. E 


Corollary 11.12 


Let K be an extension fieid of F and let £ be the set of all elements of K that 
are algebraic over F. Then £ is a subfield of K and an algebraic extension field 
of F. 


Proof» Every element of F is algebraic over F, so FS E. If u, v E E, then u and v 
are algebraic over F by definition. The subfield F (u, v) is an algebraic ex- 
tension of F by Theorem 11.10, and, hence, F{u, v) & E. Since F(u, v) is 
a field, u + v, uv, —u, —v E Flu, v) S E. Similarly, if u is nonzero, then 
u`! e Flu, v) S E. Therefore, E is closed under addition and multiplica- 
tion; negatives and inverses of elements of E are also in E. Hence, E is a 
field. m 


EXAMPLE 7 


If K = C and F = Q in Corollary 11.12, then the field E is called the field of 
algebraic numbers. The field Æ is an infinite-dimensional algebraic extension 
of Q (Exercise 16). Algebraic numbers were discussed in a somewhat different 
context on page 350. 
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E Exercises 


NOTE: Unless stated otherwise, K is an extension field of the field F. 
A. 1. If u, ve K, verify that F(u)(v) = F(v)(2). 
2. If Kisa finite field, show that K is an algebraic extension of F. 
3. Find a basis of the given extension field of Q. 
(a) AV5,i) (b) AVS, V) ©) Av2, V3, V5) @ Q(-V2, V3) 
. Find a basis of Q( v2, + v3) over Q(v3). 
. Show that [Q( V3, i):Q] = 4. 
. Verify that [Q(V2, V5, V10):Q] = 4. 
. If [K:F] is finite and u is algebraic over K, prove that [K(u):K] = [F(u):F]. 


. If [K:F] is finite and u is algebraic over K, prove that [K(u):F(z)] = [K:F]. 
[Hint: Show that any basis of K over F spans K(z) over F(z).] 


9. If [K:F] is finite and wu is algebraic over K, prove that [F(u):F'] divides [K(z):F]. 


B.10. Prove that [K:F] is finite if and only if K = F(u,,..., tip), with each u; 
algebraic over F. [This is a stronger version of Theorem 11.10.] 


9 Nu A wm db 


11. Assume that u, v E K are algebraic over F, with minimal polynomials p(x) and 
q(x), respectively. 
(a) If deg p(x) = m and deg g(x) = n and (m, n) = 1, prove that [Ru, v):F] = mn. 
(b) Show by example that the conclusion of part (a) may be false if and n 
are not relatively prime. 


(c) What is [Q( V2, V/2):Q]? 

12. Let D bea ring such that FS DS K. If K is algebraic over F, prove that D isa 
field. [Hint: To find the inverse of a nonzero u E D, use Theorem 11.7 to show 
that F(z) & D.] 

13. Let p(x) and g(x) be irreducible in F[>] and assume that deg p(x) is relatively 
prime to deg g(x). Let u be a root of p(x) and v a root of g(x) in some 
extension field of F. Prove that q(x) is irreducible over Fu). 

14. (a) Let Fj © Fe FE. * bea chain of fields. Prove that the union of all the F, 

is also a field. 
(b) If each F; is algebraic over F}, show that the union of the F; is an algebraic 
extension of Fy. 


15. Let Ebe the field of all elements of K that are algebraic over F, as in Corol- 
lary 11.12. Prove that every element of the set K — E is transcendental over E. 


16. Let E be the field of algebraic numbers (see Example 7). Prove that E is an 
infinite dimensional algebraic extension of Q. [Hint: It suffices to show that 
[E:Q] = n for every positive integer n. Consider roots of the polynomial 
x” — 2 and Eisenstein’s Criterion.] 
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17. Assume that 1p +1, # Op If u E F, let Vu denote a root of x7 — u 
in K. Prove that F(Vu + Vv) = F(Vu, Vv). [Hint 1, (Vu + Vo), 
(Vu + Vo Ý, (Vu + Vu }, etc., must span F{ Vu + Vu) by Theorem 11.7. 
Use this to show that Vu and Vvare in F(Vu + Vv).] 


18. If m,...,”, are distinct positive integers, show that 
[Q(-Vay,..., Vn): Q) = 2". 


C. 19. If each n, is prime in Exercise 18, show that = may be replaced by =. 


m4 Splitting Fields 


Let F be a field and f(x) a polynomial in F[x]. Previously we considered extension 
fields of F that contained a root of f(x). Now we investigate extension fields that 
contain #// the roots of f(x). 

The word “all” in this context needs some clarification. Suppose f(x) has degree n. 
Then by Corollary 4.17, f(x) has at most n roots in any field. So if an extension field 
K of F contains n distinct roots of f(x), one can reasonably say that K contains “all” 
the roots of f(x), even though there may be another extension of F that also contains 
n roots of f(x). On the other hand, suppose that K contains fewer than n roots of f(x). 
It might be possible to find an extension field of K that contains additional roots of 
f(x). But if no such extension of K exists, it is reasonable to say that K contains “all” 
the roots. We can express this condition in a usable form as follows. 

Let K be an extension field of F and f(x) a nonconstant polynomial of degree n in 
Fix]. If f(x) factors in K[x] as 

SE) = efx — u(x — u) + > + — uy) 

then we say that f(x) splits over the field K. In this case, the (not necessarily distinct) 
elements wu), . . . , u, are the only roots of f(x) in K or in any extension field of K. For 
if v is in some extension of K and f(v) = 0, then c(v ~ u))(v — u7) +++ w — uy) = Op 
Now c is nonzero since f(x) is nonconstant. Hence one of the v — u; must be zero, that 
is, v = u. So if f(x) splits over K, we can reasonably say that K contains all the roots 


of f(x). The next step is to consider the smallest extension field that contains all the 
roots of f(x). 


Definition lf F is a field and f(x) €FL[x], then an extension field K of F is said to be a 
splitting field (or root field) of f(x) over F provided that 


(i) f(x) splits over K, say f(x) = c(x — UX — Us) +> + (X — Gy); 
(ii) K = Fug, Us, ... tty). 


EXAMPLE 1 


If x? + 1 is considered as a polynomial in R[x], then € is a splitting field since 
+ 1 = (x + Xx -— din Cx] and € = RÒ = RG, —#. Similarly, Q( v2) is a splitting 


(Copyright 2012 Cangnge Lamrning All Rights Ramrwet Mag not ba Dpi, erann, or dapticaned, in whale or ia part Dus 10 electronic cights, ane third party content may te axppramed fren the eBook rador eChagher(e). Edinmriai eeview has 
Geman that any capped oati does not mammially effect the oveni lamning anperiamce Cengage Learning masrves the right to cncmnve atididonal coatesi at 689 tine if atamam) xiptts comricticms require it. 


11.4 Splitting Fields 389 


field of the polynomial x? — 2 in Q[x] since x? — 2 = (x + V2)(x — V2) and 
Q(v2) = Q(v2, - v2). 


EXAMPLE 2 
The polynomial f(x) = x* — x? — 2 in Q[x] factors as (x? — 2)(x* + 1), so its 


roots in € are + V2 and + i. Therefore, Q( V2, i) is a splitting field of f(x) 
over Q. 


EXAMPLE 3 


Every first-degree polynomial ex + din F[x] splits over F since ex + d = 
(x — (—e'd)) with -e“'de F. Obviously, F is the smallest field containing both 
Fandc“'d, that is, F = F(e~'d). So Fitself is the splitting field of ex + d over F. 


EXAMPLE 4 


The concept of splitting field depends on the polynomial and the base field. For 
instance, C is a splitting field of x? + 1 over R but not over Q because Ĉ is not 
the extension Q(i, — = Q(i). See Exercise 1 for a proof. 


At this point we need to answer two major questions about splitting fields: Does 
every polynomial in F[x] have a splitting field over F? If it has more than one splitting 
field over F, how are they related? 

The informal answer to the first question is easy. Given f(x) € F [x], we can find an 
extension F(u) that contains a root u of f(x) by Corollary 5.12. By the Factor Theorem 
in F(u)[x], we know that f(x) = (x — u)g(x). By Corollary 5.12 again there is an exten- 
sion F(u)(v) of F(u) that contains a root v of g(x). Continuing this, we eventually get a 
splitting field of f(x). We can formalize this argument via induction and prove slightly 
more: 


Theorem 11.13 


Let F be a field and f(x) a nonconstant polynomial of degree n in F(x}. Then 
there exists a splitting field K of f(x) over F such that [K:F] < nl. 


Proof» The proof is by induction on the degree of f(x). If f(x) has degree 1, 
then F itself is a splitting field of f(x) and [FF] = 1 = 1!. Suppose 
the theorem is true for all polynomials of degree z — 1 and that f(x) 
has degree n. By Theorem 4.14 f(x) has an irreducible factor in Fx] 
Multiplying this polynomial by the inverse of its leading coefficient 
produces a monic irreducible factor p(x) of f(x). By Theorem 5.11 
there is an extension field that contains a root u of p(x) (and, hence, 
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of #(x)). Furthermore, p(x) is necessarily the minimal polynomial of u. 
Consequently, by Theorem 11.7 [F(u) : F] = deg p(x) = deg f(x) = n. 
The Factor Theorem 4.16 shows that f(x) = (x — u)g(x) for some g(x) 
E Ftu)[x]. Since g(x) has degree n — 1, the induction hypothesis guar- 
antees the existence of a splitting field K of g(x) over F(z) such that 
[K:Flu)] = (n — 1)!. In K[x], 


B(x) = C(x — uy)(x — uy) +++ Ge — th) 
and, hence, f(x) = e(x — u)(x — u) + ++ (x — u1). Since 


K = Ful(u,..., Up = Flu, ty, . . +s ta) 


we see that K is a splitting field of f(x) over F such that [K:F] = [K-F(u)] 
[F@):F] = ((n — 1)!)n = n!. This completes the inductive step and the 
proof of the theorem. W 


The relationship between two splitting fields of the same polynomial is quite easy 
to state: 


Any two splitting fields of a polynomial in F|x| are isomorphic. 


Surprisingly, the easiest way to prove this fact is to prove a stronger result of which 
this is a special case. 


Theorem 11.14 


Let o:f — E be an isomorphism of fields, f(x) a nonconstant polynomial in 
Fix], and of{x) the corresponding polynomial in E[x]. If K is a splitting field of 
f(x) over F and L is a splitting field of of(x) over E, then o extends to an 
isomorphism K = L. 


If F = E andpg is the identity map F — F, then the theorem states that any two 
splitting fields of f(x) are isomorphic. 


Proof of Theorem 11.14> The proof is by induction on the degree of f(x). If 
deg f(x) = 1, then by the definition of splitting field f(x) = c(x — u) in 
K [x] and K = F(u). But f(x) = ex — cu is in F[x], so we must have c 
and cu in F. Hence, = c™eu is also in F. Therefore, K = Flu) = F. On 
page 380 we saw that g extends to an isomorphism F[x] = E[x]; hence, 
of(x) also has degree 1, and a similar argument shows that E = L. In 
this case, g itself is an isomorphism with the required properties. 
Suppose the theorem is true for polynomials of degree n — 1 and that 
f(x) has degree n. As in the proof of Theorem 11.13, f(x) has a monic 
irreducible factor p(x) in F[x] by Theorem 4.14. Since g extends to an 
isomorphism F[x] = E[x], (page 380), op(x) is a monic irreducible factor 
of of (x) in E[x]. Every root of p(x) is also a root of f(x), so K contains 
all the roots of p(x), and similarly L contains all the roots of ap(x). Let 
u be a root of p(x) in K and v a root of a p(x) in L. Then o extends to an 
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isomorphism F(u) > E(w) that maps u to v by Corollary 11.8, and the 


situation looks like this: 
K L 
wi a w 
Flu) -= Ko) 


The Factor Theorem 4.16 shows that f(x) = (x — u)g(x) in F(u)[x] and, 
hence, in E(v)[x] 


af(x) = alx — ujag(x) = (x — wu)og(x) = (x — v)og(x). 


Now f(x) splits over K, say f(x) = q(x — u)(x — u) +++ (x — u). 

Since f(x) = (x — u)g(x), we have g(x) = c(x — uz) * + +(x — u,). The 
smallest subfield containing all the roots of g(x) and the field F(z) is 

F(u, uz ..., Up) = K, so Kis a splitting field of g(x) over F(u). Similarly, 
Lis a splitting field of ag(x) over E(v). Since g(x) has degree n — 1, the 
induction hypothesis implies that the isomorphism F(u) = E(w) can be 
extended to an isomorphism K = L. This completes the inductive step 
and the proof of the theorem. E 


A splitting field of some polynomial over F contains all the roots of that poly- 
nomial by definition. Surprisingly, however, splitting fields have a much stronger 
property, which we now define. 


Definition An algebraic extension field K.of F is normal provided that whenever an 
irreducible polynomial in Ax] has one root in K, then it splits over K {that 


is, has all its roots in K). 


Theorem 11.15 


The field K is a splitting field over the field F of some polynomial in Ax] if and 
only if K is a finite-dimensional, normal extension of F. 


Proof» if Kisa splitting field of f(x) € F[x], then K = F(u;, . . . , u,), where the 
u; are all the roots of f(x). Consequently, [K-F) is finite by Theorem 11.10. 
Let p(x) be an irreducible polynomial in F[x] that has a root vin K. 
Consider p(x) as a polynomial in K]x] and let L be a splitting field of 
p(x) over K, so that F< K& L. To prove that p(x) splits over K, we need 
only show that every root of p(x) in L is actually in K. 

Let we L be any root of p(x) other than v. By Corollary 11.8 (with 

E = F and ø the identity map), there is an isomorphism F(v) = F(w) that 
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maps v to w and maps every element of F to itself. Consider the subfield 
K(w) of L; the situation looks like this: 


K K(w) 
UI UI 
Flv) = Fw) 
UI UI 
F = EF 


Since 


K(w) = Ru, .. . , u,)(w) = Fuj, - ..., tty, w) = Rw)lus . - . 5 Uy) 


we see that K(w) is a splitting field of f(x) over F(w). Furthermore, since 
vE Kand Kis a splitting field of f(x) over F, K is also a splitting field 

of f(x) over the subfield F(v). Consequently, by Theorem 11.14 the iso- 
morphism F(v) = F(w) extends to an isomorphism K —> K(w) that maps 
v to w and every element of F to itself. Therefore, [K:F] = [K(w):F] by 
Theorem 11.5. In the extension chain F S K & K(w), [K(w):K] is finite by 
Theorem 11.7 and [K:F] is finite by the remarks in the first paragraph of 
the proof. So Theorem 11.4 implies that 


[K:F] = [K(w):F] = [Klw):K]K:F]. 


Canceling [K:F] on each end shows that [K(w):K] = 1, and, therefore, 
K(w) = K. But this means that w is in K. Thus every root of p(x) in L is 
in K, and p(x) splits over K. Therefore, K is normal over F. 

Conversely, assume K is a finite-dimensional, normal extension of F 
with basis {u;,..., u„}. Then K = Kay, . . . , tip). Each u, is algebraic 
over F by Theorem 11.9 with minimal polynomial p{x). Since each p{x) 
splits over K by normality, f(x) = p,(x) + > + p,(2) also splits over K. 
Therefore, K is the splitting field of f(x). E 


EXAMPLE 5 


The field a(v2) contains the real root Y2 of the irreducible polynomial 

x3 — 2€ Q[x] but does not contain the complex root W/2w (as described in 
Example 7 of Section 11.2). Therefore, Q(v2) is not anormal extension of Q 
and, hence, cannot be the splitting field of any polynomial in Q[x]. 


At this point it is natural to ask if a field F has an extension field over which 
every polynomial in F[x] splits. In other words, is there an extension field that 
contains all the roots of all the polynomials in F[x]? The answer is “yes,” but the 
proof is beyond the scope of this book. A field over which every nonconstant 
polynomial splits is said to be algebraically closed. For example, the Fundamental 
Theorem of Algebra and Corollary 4.28 show that the field C of complex numbers 
is algebraically closed. 
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If K is an algebraic extension of Fand K is algebraically closed, then K is called the 
algebraic closure of F. The word “the” is justified by a theorem analogous to Theorem 
11.14 that says any two algebraic closures of F are isomorphic. For example, C is the 
algebraic closure of R since C = R(i) is an algebraic extension of R that is algebraically 
closed. The field C is not the algebraic closure of Q, however, since C is not alge- 
braic over Q. The subfield E of algebraic numbers (see Example 7 of Section 11.3) 
is the algebraic closure of Q (Exercise 20). 


@ Exercises 


NOTE: F is a field. 


A. 1. Show that V2is not in Q(i) and, hence, C # Q(i). [Hint: Show that V2 = a + bi, 
with a, be Q, leads to a contradiction.] 


2. Show that x? — 3 and x? — 2x — 2 are irreducible in Q[x] and have the same 
splitting field, namely Q( V3). 


3. Find a splitting field of x* — 4x? — 5 over Q and show that it has dimension 
4 over Q. 


4. If f(x) € Rix], prove that R or € is a splitting field of f(x) over R. 


5. Let K bea splitting field of f(x) over F. If E is a field such that Fe ESK, 
show that K is a splitting field of f(x) over E. 


6. Let K bea splitting field of f(x) over F. If [K:F] is prime, u € K is a root of 
f(x), and u € F, show that K = F(u). 


7. If u is algebraic over Fand K = F(z) isa normal extension of F, prove that K 
is a splitting field over F of the minimal polynomial of u. 


8. Which of the following are normal extensions of Q? 
(a) Q(V3) (b) Q(W3) © QV, ’) 

9. Prove that no finite field is algebraically closed. [Hint: If the elements of the 
field Fare a}, . . . , an, with a; nonzero, consider 
a + (x -ax — a) +++ (x — a) E F[x]] 


B. 10. By finding quadratic factors, show that Q( V2, V3) is a splitting field of 
xt + 2x? — 8x7 — 6x — 1 over Q. 


11. Find and describe a splitting field of x* + 1 over Q. 
12. Find a splitting field of x* — 2 
(a) over Q. (b) over R. 
13. Find a splitting field of xf + x? + 1 over Q. 
14. Show that Q(-V2, i) is a splitting field of x* — 2V2x + 3 over Q( V2). 
15. Find a splitting field of x? + 1 over Z3. 
16. Find a splitting field of x? + x + 1 over Z}. 
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17. If K is an extension field of F such that [K:F] = 2, prove that K is normal. 


18. Let F, E, K be fields such that FC EG K and E = F(w,..., u,), where the u; 
are some of the roots of f(x) € F[x]. Prove that Kis a splitting field of f(x) 
over F if and only if K is a splitting field of f(x) over E. 


19. Prove that the following conditions on a field K are equivalent: 
(i) Every nonconstant polynomial in K[x] has a root in K. 


(ii) Every nonconstant polynomial in K[x] splits over K (that is, K is 
algebraically closed). 


(iii) Every irreducible polynomial in K[x] has degree 1. 
(iv) There is no algebraic extension field of K except K itself. 


20. Let K be an extension field of F and E the subfield of all elements of K that 
are algebraic over F, as in Corollary 11.12. If K is algebraically closed, prove 
that E is an algebraic closure of F. [The special case when F = Q and K = € 
shows that the field E of algebraic numbers is an algebraic closure of ©] 


21. Let K be an algebraic extension field of F such that every polynomial in F(x) 
splits over K. Prove that K is an algebraic closure of F. 


C.22. If Kis a finite-dimensional extension field of F and ¢:F — Kis a homomorphism 
of fields, prove that there exists an extension field L of K and a homomorphism 
7:K — L such that 7(a) = g(a) for every aE F. 


23. Prove that a finite-dimensional extension field K of F is normal if and only if 
it has this property: Whenever L is an extension field of K and g:K—> L an 
injective homomorphism such that g (e) = e for every c E F, then o(K) & K. 


pg Separability 


Every polynomial has a splitting field that contains all its roots. These roots may all be 
distinct, or there may be repeated roots.* In this section we consider the case when the 
roots are distinct and use the information obtained to prove a very useful fact about 
finite-dimensional extensions. 

Let F be a field. A polynomial f(x) € F[x] of degree n is said to be separable if it 
has n distinct roots in some splitting field.t Equivalently, f(x) is separable if it has no 
repeated roots in any splitting field. If K is an extension field of F, then an element 
uE K is said to be separable over F if u is algebraic over F and its minimal polynomial 
p(x) € F[x] is separable. The extension field K is said to be a separable extension (or to 
be separable over F) if every element of K is separable over F. Thus a separable exten- 
sion is necessarily algebraic. 


*A repeated root occurs when f(x) = (x — u) +++ (x — u,) in the splitting field and some u; = ùj 
with i+ j. 

tSince any two splitting fields are isomorphic, this means that f(x) has n distinct roots in every 
splitting field. 
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EXAMPLE 1 


The polynomial x? + 1 € Q[x] is separable since it has distinct roots i and —i 
in C. But f(x) = x4 — x? — x + 1is not separable because it factors as 

— IP (7 + x + 1). Hence, f(x) has one repeated root and a total of three 
distinct roots in C. 


There are several tests for separability that make use of the following concept. The 
derivative of 


JA) = co Hex + x? + +++ + ee Fx] 
is defined to be the polynomial 
J'E) = ci + 2eax + 3e + +++ + nep! Flx]*. 


You should use Exercises 4 and 5 to verify that derivatives defined in this algebraic 
fashion have these familiar properties. 


(f+ g)'(x) =F) + 8’) 
(fa)' (=) = FON C) + fg). 


Lemma 11.16 


Let F bea field and f(x) € F[x]. if f(x) and f'(x) are relatively prime in F[x], then 
f(x) is separable. 


Note that the lemma operates entirely in F[x] and does not require any knowl- 
edge of the splitting field to determine separability. For other separability criteria, see 
Exercises 8— 10. 


Proof of Lemma 11.16 > we shall prove the contrapositive: If f(x) is not separable, 
then f(x) and f'(x) are not relatively prime (which is logically equivalent 
to the statement of the theorem)? Let K be a splitting field of f(x) and 
suppose that f(x) is not separable. Then f(x) must have a repeated root u 
in K. Hence, f(x) = (x — u)*g(x) for some g(x) € K[x] and 


F(X) = E = Yg + 2x — u)g(x). 


Therefore, f'(u) = 0,¢'(u) + Oglu) = 0, and v is also a root of f'(x). If 
P(x) E F[x] is the minimal polynomial of u, then p(x) is nonconstant and 
divides both f(x) and f’(x). Therefore, f(x) and f'(x) are not relatively 
pime E 


*When F = R, this is the usual derivative of elementary calculus. But our definition is purely algebraic 
and applies to polynomials over any field, whereas the limits used in calculus may not be defined in 
some fields. 

tSee Appendix A (pages 503, 504 and 506) for the definition and use of the contrapositive in proofs. 
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Recall that for a positive integer n and c E F, 
ne is the element c + c + ++ > + c (n summands). 


A field F is said to have characteristic 0 if n1; # Oç for every positive n. For example, 
Q, R, and C all have characteristic 0, but Z, does not (since 3 + 1 = 0 in Z;). Every 
field of characteristic 0 is infinite (Exercise 3). If F has characteristic 0, then for every 
positive and ce F, 


no=ct+++++eH=(Ipt:': + Ipe=(nipe with nly # Op. 


So nc = Op if and only if c = Op This fact is the key to separability in fields of char- 
acteristic 0: 


Theorem 11,17 


Let F be afield of characteristic 0. Then every irreducible polynomial in F[x] is 
separable, and every algebraic extension field K of F is a separable extension. 


The theorem may be false if F does not have characteristic 0 (Exercise 15). 
Proof of Theorem 11.17 > An irreducible P(x) € F[x] is nonconstant and, hence, 
P(x) = cx” + (lower-degree terms), with c + Opandn= 1. 
Then 
P'(x) = (ne)x""! + (lower-degree terms), with nc # Op. 


Therefore, p’(x) is a nonzero polynomial of lower degree than the 
irreducible p(x). So p(x) and p’(x) must be relatively prime. Hence, p(x) 
is separable by Lemma 11.16. In particular, the minimal polynomial of 
each u € K is separable. So K isa separable extension. W 


Separable extensions are particularly nice because every finitely generated (in 
particular, every finite-dimensional) separable extension is actually simple: 


Theorem 11, 18* 


If K is a finitely generated separable extension field of F, then K = F(u) for 
some ueK, 


Proof» By hypothesis K = F(t, ... , u„). The proof is by induction on n. There 
is nothing to prove when n = 1 and K = F({u)). In the next paragraph we 
shall show that the theorem is true for n = 2. Assume inductively that it 
is true for n = k — 1 and suppose n = k. By induction and the case n = 2, 
there exist ¢, u E K such that 


K= Rup... ug) = Flay, . «+ 5 Ugu) = FO(u) = i, ty) = Flu). 
*This theorem will be used only in Section 12.2. 
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To complete the proof, we assume K = F(v, w) and show that K is 
a simple extension of F. Assume first that F is infinite (which is always 
the case in characteristic 0 by Exercise 3). Let p(x) € F[x] be the minimal 
polynomial of v and g(x) € F[x] the minimal polynomial of w. Let L bea 
splitting field of p(x)q(x) over F. Let w = w, wz . .., Wa be the roots of 
q(x) in L. By the definition of separability, all the w, are distinct. Let 
U = Uj, Ug. «+5 Um be the roots of p(x) in L. Since Fis infinite, there exists 
cE Fsuch that 


a 


(+) c# foralll sism,1<jsn. 


WwW — w 
Let u = v + cw. We claim that K = Hu). To show that we F(u), let 
A(x) = plu — cx) E€ F(u)[x] and note that w is a root of A(x): 


h(w) = plu — cw) = plo) = Op 
Suppose some w (with j # 1) is also a root of A(x). Then plu — ew) = 
0z so that u — cwyis one of the roots of p(x), say u — cw, = v, Since 


u =v + cw, we would have 


A yv 
v + cw — cw,= v, or, equivalently, c= 


wW— w; 

This contradicts (+). Therefore, w is the only common root of g(x) and A(x). 
Let r(x) be the minimal polynomial of w over F(u). Then r(x) 

divides g(x), so that every root of r(x) is a root of g(x). But r(x) also 

divides A(x), so all its roots are roots of A(x). By the preceding para- 

graph, r(x) has a single root w in L. Therefore, r(x) € F(u)[x] must have 

degree 1, and, hence, its root w is in Hu). Since v = u — cw, with u, 

w E F(u), we see that v E F(u) and, hence, K = Flv, w) & Fu). But 

u = v + cwE K, so Ru) & K, whence K = Ru). This completes the 

proof when Fis infinite. For the case of finite F, see Theorem 11.28 in 

the next section. W 


EXAMPLE 2 


Applying the proof of the theorem to Q( V3, V5), we have v = V3, v, = —V3, 
w = V5, un = — V5, so we can choose ¢ = 1. Then u = V3 + V5 and 
Q(V3, V5) is the simple extension Q( V3 + V5). 


E Exercises 


NOTE: K is an extension field of the field F. 


A. 1. If Kis separable over F and E is a field with F& E & K, show that K is 
separable over E. 


2. If F has characteristic 0, show that K has characteristic 0. 
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3. 


10. 


11. 


12. 


Field Extensions 


Prove that every field of characteristic 0 is infinite. [Hint: Consider the elements 
nipwithneZ,n>0.] 


. If f(x), a(x) E F[x], prove 


(a) f+ 8)'(x) =f") + g'(x). 
(b) If cE F, then (cf)'(x) = ef’ (x). 


. (a) If f(x) = cx" e Fix] and g(x) = by + bx +++ + + b,x*  F[x], prove that 


(f2)'(x) = fg) + f'e). 


(b) If f(x), g(x) are any polynomials in F[x], prove that (fg) (x) = f(x)g’(x) + 
S'(x)g(x). [Hint: If f(x) = a + ajx +++ + + ax”, then (fg)(x) = age(x) + 
a,xg(x) +++ © + a,x"g(x); use part (a) and Exercise 4.] 


. If f(x) € F[x] and n is a positive integer, prove that the derivative of f(x)" is 


nf( f'(x). [Hint: Use induction on n and Exercise 5.] 


. (a) If F has characteristic 0, f(x) € F[x], and f'(x) = Op, prove that f(x) = c for 


some cE F. 


(b) Give an example in Z.[x] to show that part (a) may be false if F does not 
have characteristic 0. 


. Prove that x E K is a repeated root of f(x) € F[x] if and only if «is a root of 


both f(x) and f'(x). [Hint: f(x) = (x — u)"g(x) with m = 1, g(x) € K[x], and 
glu) # 0r, u is a repeated root of f(x) if and only if m > 1. Use Exercises 5 and 
6 to compute f '(x).] 


. Prove that f(x) € F[x] is separable if and only if f(x) and f’(x) are relatively 


prime. [Hint: See Lemma 11.16 and Exercise 8.] 

Let p(x) be irreducible in F[x]. Prove that p(x) is separable if and only if 
P'(x) # Op. 

Assume F has characteristic 0 and K isa splitting field of f(x) € F[x]. If d(x) 
is the greatest common divisor of f(x) and f(x) and A(x) = f(x)/d(x) € Fix], 
prove 


(a) f(x) and A(x) have the same roots in K. 
(b) A(x) is separable. 


Use the proof of Theorem 11.18 to express each of these as simple extensions 
of Q: 


(a) Q(-V2, V3) (b) Q(-V3,4) (9 Q(-V2, V3, V5) 


. If p and q are distinct primes, prove that Q(Vp, V4) = Q( Vp ap va). 


. Assume that F is infinite, that v, w € K are algebraic over F, and that w is the 


root of a separable polynomial in F[x]. Prove that F{v, w) is a simple extension 
of F. [Hint: Adapt the proof of Theorem 11.18.] 


. Here is an example of an irreducible polynomial that is not separable. Let 


F = Z,(t) be the quotient field of Z,[7] (the ring of polynomials in 
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the indeterminate ¢ with coefficients in Z,), as in Example 1 of 

Section 10.4. 

(a) Prove that x* — t is an irreducible polynomial in F[x]. [Hint If x7 — t 
has a root in F, then there are polynomials g(t), A(t) in Z,[¢] such that 
[g(2)/A(OF = t; this leads to a contradiction; apply Corollary 4.19.] 


(b) Prove that x? — te F[x] is not separable. [Hint: Show that its derivative is 
zero and use Exercise 10.] 


116 Finite Fields 


Finite fields have applications in many areas, including projective geometry, combina- 
tories, experimental design, and cryptography. In this section, finite fields are charac- 
terized in terms of field extensions and splitting fields, and their structure is completely 
determined up to isomorphism. 

We begin with some definitions and results that apply to rings that need not be 
fields or even finite. But our primary interest will be in their implications for finite 
fields. 

Let R be a ring with identity. Recall that for a positive integer m and cE R, mc is 
the element c + c + » - - + c (m summands). The ring R is said to have characteristic 0 if 
mp # Op for every positive m. On the other hand, if mlg = Op for some positive m, 
then there is a smallest such m by the Well-Ordering Axiom. Then R is said to have 
characteristic # if n is the smallest positive integer such that nlg = 0g.* For example, Q 
has characteristic 0 and Z, has characteristic 3. 


Lemma 11.19 


If R is an integral domain, then the characteristic of R is either 0 or a positive 
prime. 


Proof» If R has characteristic 0, there is nothing to prove. So assume R has 
characteristic n > 0. If n were not prime, then there would exist positive 
integers k, t such that n = kt, with k < n and t < n. The distributive laws 
show that 


(KIDEA = (lr +e + lr) (In + +++ + 12) 
i fo 
= lklr +--+ + Iplp=Ipt:>:+ 1p [kt summands] 
= (kd)1p = nlg = Op 


*If you have read Chapter 7, you will recognize that when the characteristic of R is positive, it is 
simply the order of the element 1, in the additive group of R. 
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Since R is an integral domain either klp = Op or tlg = Op, contradict- 
ing the fact that n is the smallest positive integer such that nlp = Op. 
Therefore, n is prime. W 


Lemma 11.20 


Let R be a ring with identity of characteristic n > 0. Then k1g = Op if and only 
if n |k.* 


Proof» If n |k, say k = nd, then klg = ndlp = (nla)(dla) = Or (dlp) = Op 
Conversely, suppose k1p = Opg. By the Division Algorithm, k = ng + r 
with 0 = r < n. Now nl, = Op, so that 


rlp = rlr + Og = rig + nglg = (r + ng)lz = klg = Opg. 


Since r < n and n is the smallest positive integer such that nlp = Op by 
the definition of characteristic, we must have r = 0. Therefore, k = ng 
andn|k. m 


Theorem 11.21 


Let R be a ring with identity. Then 
(1) The set P = {k1g|k E Z} is a subring of R. 
(2) If R has characteristic 0, then P = Z. 
(3) FR has characteristic n > 0, then P = Zp. 


Proof» Define f:Z > R by f(k) = k1p. Then 
fik+ A= (k+ tlr = klg t tlg = fk) + fA). 
The distributive laws (as in the proof of Lemma 11.19) show that 


Jiki) = (kt)lr = (klat) = SMSO. 


Therefore, fis a homomorphism. The image of fis precisely the set P, 
and, therefore, P is a ring by Corollary 3.11. Consequently, fcan be con- 
sidered as a surjective homomorphism from Z onto P. Then P = Z/(Ker f) 
by the First Isomorphism Theorem 6.13. If R has characteristic 0, then 
the only integer k such that k1p = Op is k = 0. So the kernel of fis the 
ideal (0) in Z, and P = Z/(0) = Z. If R has characteristic n > 0, then 
Lemma 11.20 shows that the kernel of fis the principal ideal (n) consist- 
ing of all multiples of n. Hence, P= Z/(n) = Z,. E 


*This lemma is just a special case (in additive notation) of part (1) of Theorem 7.9, with a = 1g and 


e = Op. 
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According to Theorem 11.21 a field of characteristic 0 contains a copy of Z and, 
hence, must be infinite. Therefore, by Lemma 11.19 we have 


Corollary 11.22 


Every finite field has characteristic p for some prime p. 


The converse of Corollary 11.22 is false, however, since there are infinite fields of 
characteristic p (Exercise 8). 

If K isa field of prime characteristic p (in particular, if K is finite), then Theorem 11.21 
shows that K contains a subfield P isomorphic to Z,. This field P is called the prime 
subfield of K and is contained in every subfield of K (because every subfield contains 
1zand, hence, contains tlx for every integer #).* See Exercise 4 for another description 
of P. We shall identify the prime subfield P with its isomorphic copy £,; then 


every field of characteristic p contains Zy 


The number of elements in a finite field K is called the order of K. To determine the 
order of a finite field K of characteristic p, we consider K as an extension field of its 
prime subfield Z,: 


Theorem 11,23 


A finite field K has order p”, where p is the characteristic of K and n = [K: 2]. 


Proof» There is certainly a finite set of elements that spans K over Z, (the set K 
itself, for example). Consequently, by Exercise 32 of Section 11.1, K has 
a finite basis {t, up,..., u,}over Zp. Every element of K can be written 
uniquely in the form 


(*) Cyuy + Cyn ++ ** + Cyt, 


with each ¢,€ £Z, by Exercise 30 of Section 11.1. Since there are exactly p 
possibilities for each c; there are precisely p“ distinct linear combinations 
of the form (+). So K has order p", with n = number of elements in the 
basis = [K:Z,]. m 


Theorem 11.23 limits the possible size of a finite field. For instance, there can- 
not be a field of order 6 since 6 is not a power of any prime. It also suggests several 
questions: Is there a field of order p” for every prime p and every positive integer n? 


*lf K has characteristic 0, then K contains an isomorphic copy P of Z. Since K contains the 
multiplicative inverse of every nonzero element of P, it follows that K contains a copy of the field 
Q. As in the case of characteristic p, this field (called the prime subfield) is contained in every 
subfield of K. See Theorem 10.31 (with R = P = Z and F = Q) for a more precise statement and proof. 
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How are two fields of order p" related? The answers to these questions are given in 
Theorem 11.25 and its corollaries. In order to prove that theorem, we need a techni- 
cal lemma. 


Lemma 11.24 The Freshman’s Dream* 


Let p be a prime and R a commutative ring with identity of characteristic p. 
Then for every 2, DER and every positive integer n, 


(a +b) = a” +b”. 


Proof» The proof is by induction on n. If n = 1, then the Binomial Theorem in 
Appendix E shows that 


atoy = ei Peor a Cw 
t ( P jete. 
p-—l 


Each of the middle coefficients (? ) = is an integer by 


= = 
ri(p — r)! 

Exercise 6 in Appendix E. Since every term in the denominator is strictly 

less than the prime p, the factor of p in the numerator does not cancel, and, 


therefore, (? ‘i is divisible by p, say 4 ) = tp. Since R has characteristic p, 


p Jee = tpipa?" = t(plp)P -E = 0g?’ = Op. 


Thus all the middle terms are zero and (a + bY = œ + BP. So the theo- 
rem is true when n = 1. Assume the theorem is true when z = k. Using 
this assumption and the case when n = 1 shows that 


(a + bf" =((a+ 5)" 
= (a? + bY = (a? + (aP = a” + BP. 


Therefore, the theorem is true when n = k + 1 and, hence, for all n by 
induction. E 


*Terminology due to Vincent O. McBrien. 
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Theorem 11.25 


Let K be an extension field of Z, and n a positive integer. Then K has order 
p" if and only if K is a splitting field of x” — x over Zp. 


Proof» Assume Kis a spline field of f(x) = x” — xE Z,(x). Since 
f'(x) = pro" - 1 = 08! - 1 =-1 Fo is separable by 
Lemma 11.16. Let E be the subset of K consisting of the p” distinct roots 
of x” — x. Note that c€ E if and only if c” = c. We shall show that the 
set E is actually a subfield of K. If a, be E, then by Lemma 11.24. 


(a+ bP =d + ath. 


Therefore, a + bE E, and E is closed under addition. The set E is closed 
under multiplication since (ab)” = "W = ab. Obviously, Ox and 1, are 
in E. If ais a nonzero element of E, then —a and a“ are in E because, 
for example, 


(a =a = (P= 0" 


The argument for —a is similar (Exercise 7), Therefore, E is a subfield of 
K. Since the splitting field K is the smallest subfield containing the set E 
of roots, we must have K = E. Therefore, K has order p". 

Conversely, suppose K has order p". We need only show that every ele- 
ment of K is a root of x” — x, for in that case, the p" distinct elements of 
Kare all the possible roots and K is a splitting field of x*” — x.” Clearly 0, 
is a root, so let c be any nonzero element of K. Let c4, Cz, . . . , Cp be all the 
nonzero elements of K (where k = p" — 1 and cis one of the c,)) and let u be 
the product u = c,c,¢3 * * + cg. The k elements ccj, cc, . . . , cc, are all dis- 
tinct (since cc; = cc, implies c = c), so they are just the nonzero elements 
of K in some other order, and their product is the element u. Therefore, 


u = (ce,)(cen) « * > (ccp) = Elec; + > * Gy) = eu. 


Canceling u shows that c* = 1, and, hence, c**! = c, or equivalent 
ck! — ¢ = Op Since k + 1 =p", cisaroot of x” — x. E 


Theorem 11.25 has several important consequences; together with the theorem 
they provide a complete characterization of all finite fields 


Corollary 11.26 


For each positive prime p and positive integer n, there exists a field of order p”. 


Proof» A splitting field of x” — x over Z, exists by Theorem 11.13; it has order 
P 
p" by Theorem 11.25 W 


*A short proof, using group theory, is given in Exercise 22. 
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Corollary 11.27 


Two finite fields of the same order are isomorphic. 


Proof» If K and L are fields of order p", then both are splitting fields of x” — x 
over Z, by Theorem 11.25 and, hence, are isomorphic by Theorem 11.14 
(with g the identity map on Z,). E 


According to Corollary 11.27, there is (up to isomorphism) a unique field of order 
P. This field is called the Galois field of order p”. We complete our study of finite fields 
with two results whose proofs depend on group theory. 


Theorem 11,28 


Let K be a finite field and F a subfield. Then K is a simple extension of F. 


Proof » By Theorem 7.16 the multiplicative group of nonzero elements of 
K is cyclic. If u is a generator of this group, then the subfield F(u) 
contains 0; and all powers of u and, hence, contains every element of K. 
Therefore, K = Hu). E 


Corollary 11.29 


Let p be a positive prime. For each positive integer n, there exists an 
irreducible polynomial of degree n in Z,[x]. 


Proof» There is an extension field K of Z, of order p” by Corollary 11.26. By 
Theorem 11.28, K = Z,(u) for some uE K. The minimal polynomial of u 
in Z, [x] is irreducible of degree [K:Z,] by Theorem 11.7. Theorem 11.23 
shows that [K:Z,]=n. m 


E Exercises 


A. 1. If Risa ring with identity and m, n E Z, prove that (ml ,)(n1p) = (mn) 13. 
[The case of positive m, n was done in the proof of Lemma 11.19.] 


2. What is the characteristic of 
(a) Q (b) 2, X Z; (c) Z,[x] 
(d MR) (e) MZ) 


3. Let R be a ring with identity of characteristic n = 0. Prove that na = Op for 
every aE R. 


4. If Kis a field of prime characteristic p, prove that its prime subfield is the 
intersection of all the subfields of K. 
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5. Let F be a subfield of a finite field K. If F has order q, show that K has order 
d", where n = [K:F]. 


6. Show that a field K of order p” contains all kth roots of 1x, where k = p” — 1. 


7. Let E be the set of roots of x?" — x €Z,[x] in some splitting field. If ae E, 
prove that —a E E. 
B. 8. Letp be prime and let Z,(x) be the field of quotients of the polynomial ring 
Z,[x] (as in Example 1 of Section 10.4). Show that Z,(x) is an infinite field of 
characteristic p. 


9. Let R be a commutative ring with identity of prime characteristic p. If a, 
be Rand na 2 1, prove that (a — 5)” = a” — b”. 


10. Let K be a finite field of characteristic p. Prove that the map f:K — K given by 
f(a) = æ is an isomorphism. Conclude that every element of K has a pth root 
in K. 

11. Show that the Freshman’s Dream (Lemma 11.24) may be false if the 
characteristic p is not prime or if R is noncommutative. [Hint: Consider Z6 
and M(Z,).] 

12. If c is a root of f(x) € Z,[x], prove that œ is also a root. 


13. Prove Fermat's Little Theorem: If pis a prime and a EZ, then a@ =a (mod p). If 
a is relatively prime to p, then œ~! = 1 (mod p). [Hint: Translate congruence 
statements in Z into equality statements in Z, and use Theorem 11.25.] 

14. Let F be a field and f(x) a monic polynomial in F[x], whose roots are all 
distinct in any splitting field K. Let E be the set of roots of f(x) in K. If the set 
Eis actually a subfield of K, prove that F has characteristic p for some prime p 
and that f(x) = x” — x for some n = 1. 


15. (a) Show that xX? + x + 1 is irreducible in Z,[x] and construct a field of 
order 8. 


(b) Show that x? — x + 1 is irreducible in Z,[x] and construct a field of order 27. 


(c) Show that x* + x + 1 is irreducible in Z,[x] and construct a field of 
order 16. 


16. Let K be a finite field of characteristic p, F a subfield of K, and m a positive 
integer. If L = {a € K| a?” € F}, prove that 


(a) Lisa subfield of K that contains F. 


(b) L = F. [Hint: Use Exercise 10 to show that the map g:K — K given by 
g(a) = d isan isomorphism such that g( F) = F. Whatis g`! (F) 
17. If E and Fare subfields of a finite field K and E is isomorphic to F, prove that 
E=F. 
18. Let K be a field and k, n positive integers. 
(a) Prove that x* — 1, divides x" — 1, in K[x] if and only if k |n in Z. 
[Hint: n = kq + r by the Division Algorithm; show that x” — 1, = 
(oc — 1 Ax) + (x? — 1x), where A(x) = 2° * + x + ee + te] 
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(b) If p = 2 is an integer, prove that (7% — 1)|(p" — 1) if and only if k | n. 
[Hint: Copy the proof of part (a) with p in place of x.] 


19, Let K be a finite field of order p”. 


(a) If Fis a subfield of K, prove that F has order p*for some d such that d | n. 
[Hint: Exercise 18 may be helpful.] 


(b) H d |n, prove that K has a unique subfield of order p”. [Hint: See Exercise 17 
and Corollary 11.27 for the uniqueness part.] 


20. Let p be prime and f(x) an irreducible polynomial of degree 2 in Z,[-]. If K is 
an extension field of Z, of order p, prove that f(x) is irreducible in K[x]. 


21. Prove that every element in a finite field can be written as the sum of two 
squares. 


22. Use part (2) of Corollary 8.6 to prove that every nonzero element c of a finite 


field K of order p" satisfies ec”! = 1x. Conclude that ¢ is a root of x?" — x 
and use this fact to prove Theorem 11.25. 


Ap p lication BCH codes (Section 16.3) may be covered at this point if desired. 
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CHAPTER 12 


Galois Theory 


A major question in classical algebra was whether or not there were formulas for 
the solution of higher-degree polynomial equations (analogous to the quadratic 
formula for second-degree equations). Although formulas for third- and fourth- 
degree equations were found in the sixteenth century, no further progress was 
made for almost 300 years. Then Ruffini and Abel provided the surprising answer: 
There is no formula for the solution of alf polynomial equations of degree n when 
n = 5. This result did not rule out the possibility that the solutions of special types 
of equations might be obtainable from a formula. Nor did it give any clue as to 
which equations might be solvable by formula. 

It was the amazingly original work of Galois that provided the full explanation, 
including a criterion for determining which polynomial equations can be solved 
by a formula. Galois’ ideas had a profound influence on the development of later 
mathematics, far beyond the scope of the original solvability problem. 

The solutions of the equation f(x) = 0 lie in some extension of the coefficient 
field of f(x). Galois’ remarkable discovery was the close connection between such 
field extensions and groups (Section 12.1). A detailed description of the connec- 
tion is given by the Fundamental Theorem of Galois Theory in Section 12.2. This 
theorem is the principal tool for proving Galois’ Criterion for the solvability of 
equations by formula (Section 12.3). 


121 The Galois Group 


The key to studying field extensions is to associate with each extension a certain group, 
called its Galois group. The properties of the Galois group and theorems of group 
theory can then be used to establish important facts about the field extension. In this 
section we define the Galois group and develop its basic properties. Throughout this 
section F is a field. 


407 
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Definition Let K be an extension field of F. An F-automorphism of K is an isomor- 
phism o:K — K that fixes F elementwise (that is, ¢(c) = c for every cef}. 
The set of all F-automorphisms of K is denoted Gal-K and is called the 
Galois group of K over F. 


The use of the word “group” in the definition is justified by: 


Theorem 12.1 


If K is an extension field of F, then Gal,;K is a group under the operation of 
composition of functions. 


Proof» Gal pK is nonempty since the identity map u:K > K is an automor- 
phism.* If g, TE GalpK then g ° T is an isomorphism from K to K 
by Exercise 27 of Section 3.3. For each c E F, (Ø ° TXA) = a(t(c)) = 
o(c) = c. Hence, g ° T E€ Gal;K, and Gal XK is closed. Composition of 
functions is associative, and the identity map « is the identity element of 
Gal,K. Every bijective function has an inverse function by Theorem B.1 
in Appendix B. If g € Gal,K, then 0’ is an isomorphism from K to K 
by Exercise 29 of Section 3.3. Verify that ¢~1{c) = c for every ce F 
(Exercise 1). Therefore, o~! € Gal;K, and GalpK isa group. W 


EXAMPLE 1.A? 


The complex conjugation map o:€ — C given by g(a + bi) = a — bi is an auto- 
morphism of C, as shown in Example 3 of Section 3.3. For every real number a, 


g(a) = gla + 0) =a — Oi =a. 


So ø isin GalgC. Note that i and —i are the roots of x? + 1 €R and that g maps 
these roots onto each other: o(#) = —i and o(—A) = i. This is an example of the 
next Theorem. 


Theorem 12.2 


Let K be an extension field of F and f(x) EF[x]. If ue K is a root of f(x) and 
a €Gal,K, then a(u) is also a root of f(x). 


*Throughout this chapter, « denotes the identity map on the field under discussion. 


tThroughoutthis section and the next, three basic examples appear repeatedly. The first appearance 
of Example 1 is labeled 1.A, its second appearance 1.B, etc.; the first appearance of Example 2 is 
labeled 2.A, and so on. 
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Proof» If f(x) = co + qx + op? + +++ + cpa”, then 
Co + cu + cw? + +++ + epu” = Op 
Since g is a homomorphism and g(c,) = c for each c,€ F, 


Or = G(0x) = (cg + ciu + cru? + +++ + cpu”) 
= ø(ca) + alcolu) + o(c)o(uy + +++ + of(c,o(uy" 
= + c0(u) + c20(u)? +--+ + eou = Kolu). 


Therefore, g(u) is a root of f(x). E 


Let u E K be algebraic over F with minimal polynomial p(x) € F[x]. Theorem 12.2 
states that every image of u under an automorphism of the Galois group must also be 
a root of p(x). Conversely, is every root of p(x) in K the image of u under some auto- 
morphism of Gal,K? Here is one case where the answer is yes. 


Theorem 12.3 


Let K be the splitting field of some polynomial over F and let u, ve K. Then 
there exists g € Gal;K such that o(u) = v if and only if u and v have the same 
minimal polynomial in F[x]. 


Proof» 1f u and v have the same minimal polynomial, then by Corollary 11.8 
there is an isomorphism g:F(u) = F(v) such that o(u) = v, and g fixes 
F elementwise. Since K is a splitting field of some polynomial over F, 
it is a splitting field of the same polynomial over both F(u) and F(v). 
Therefore, g extends to an F-automorphism of K (also denoted g) by 
Theorem 11.14. In other words, g € GalpK and o(u) = v. The converse is 
an immediate consequence of Theorem 12.2. WE 


EXAMPLE 1.B 


Example 1.A shows that Gal,C has at least two elements, the identity map + and 
the complex conjugation map g. We now prove that these are the only elements 
in GalgC. Let 7 be any automorphism in GalgC. Since iis a root of x? + 1, 

(i) = +i by Theorem 12.2. If 7(é) = i, then since 7 fixes every element of R, 


(a + bi) = 7(a) + r(b)r(i) = a + bi, 
and, hence, T = +. Similarly, if 7( = —i, then 
qla + bi) = t(a) + T(b)r(i) = a + b(—À = a — bi, 


and, therefore, T = g. Thus GalpC = {:, e} is a group of order 2 and, hence, 
isomorphic to Z, by Theorem 8.7. 
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The preceding example shows that an R-automorphism of C = R(i) is completely 
determined by its action on i. The same thing is true in the general case: 


Theorem 12.4 


Let K = A(uy,..., Un) be an algebraic extension field of F. tf o, TE Gal-K and 
alu) = tu) for each f= 1,2,..., N, then ø = T. In other words, an auto- 
morphism in Gal,K is completely determined by its action on uj, ..., Up: 


Proof» Let B = T7! ° g € Gal pK. We shall show that £ is the identity map z. 
Since g(u) = T(z, for every i, 


Bu) = (T ° 0) (u) = T'OU) = T'U) = (T ° Tu) = u= Uy. 


Let vE F(u). By Theorem 11.7 there exist cE F such that v = cy + cyu + cu? + 
+++ Cp-141”!, where m is the degree of the minimal polynomial of w. Since B is a 
homomorphism that fixes u, and every element of F, 


B©) = Bley + crn + cmt +--+ + cmt”) 
= Blc) + Bles)B(us) + BUA + ++ + Blom s)B(uy"-) 
= Co + Ciu + cra? +. C1 -1 = v 
Therefore, B(v) = v for every v E F (u). Repeating this argument with F(u) in place of 
Fand win place of u, shows that B(v) = v for every v E€ F(u)(u2) = F (u, uz). Another 
repetition, with F{u;, u2) in place of F and w, in place of u, shows that B(v) = v for 


every v E Fluy, uz, t). After a finite number of repetitions we have B(v) = v for every 
vE F(u un» ..., ty) = K, thatis, = 8 = T7! eg. Therefore, 


T=Ttor=r°o (t og) =(ToT Joo =r0g=0. M 


EXAMPLE 2.A 


By Theorem 12.2 any automorphism in the Galois group of Q(V3,V5) over Q 
takes V3 to V3 or — V3, the roots of x? — 3. Similarly, it must take V5 to 
+vV5, the roots of x? — 5. Since an automorphism is completely determined by 
its action on ‘V3 and V5 by Theorem 12.4, there are at most four automorphisms 
in GalgQ(V3,V5), corresponding to the four possible actions on V3 and V5: 


B>A Ab VIS viv 
VE—> VE VE VE VES VS VS i. 


We now show that GalgQ(V3,V5) is a group of order 4 by constructing non- 
identity automorphisms 7, œ, 8 with these actions. To construct 7, note that x? — 3 
is the minimal polynomial of both V3 and — V3 over Q. By Corollary 11.8, 
there is an isomorphism øg: Q( v3) = Q(-v3) 3) such that a(V3) = = - V3, and o 
fixes Q elementwise. Example 6 of Section 11.3 shows that x? — 5 is the mini- 
mal polynomial of V5 over Q(v3). By Corollary 11.8 again, g extends to 
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a Q-automorphism 7 of Q(V3)(V5) = Q(V3,V5) such that (V5) = V5. 
Therefore, T E GalgQ(V3,V5) and 7(V3) = o('V3) = — V3 and (V5) = V5. 
A similar two-step argument produces automorphisms @ and 8 with the actions 
listed above. Furthermore, each of T, a, B has order 2 in GalgQ(V3, V5); for 
instance, 


(r °1)(V3) = 1(r(V3)) = 1(-V3) = -1(-V3) = -(-V4) = V3 = (v3) 


and (T ° 1)(V5) = V5 = (V5). Therefore, T ° T = + by Theorem 12.4. 
Use Theorem 8.8 to conclude that GalgQ(V3,V5) = Z, X Z, or compute 
the operation table directly (Exercise 4). For instance, you can readily verify 
that (r © a)(V3) = g(V3) and (T ° a)(V5) = B(V5) and, hence, T ° a = B by 
Theorem 12.4. 


In the preceding example, Q(V3,V5) is the splitting field of (x)= 
(x? — 3)(x? — 5), and every automorphism in the Galois group permutes the four roots 
V3, —V3, V5, — V5 of f(x). This is an illustration of 


Corollary 12.5 


If K is the splitting field of a separable polynomial f(x) of degree n in Fix], then 
Gal,K is isomorphic to a subgroup of Sp. 


Proof» By separability f(x) has n distinct roots in K, say w,..., u,. Consider 
S, to be the group of permutations of the set R = {u,,...,u,}- If cE 
Gal pK, then oa(u), olu), . . ., O(z,) are roots of f(x) by Theorem 12.2. 
Furthermore, since øg is injective, they are all distinct and, hence, must be 
Uj, U2, ++.» Uy, in some order. In other words, the restriction of g to the 
set R (denoted g | R) is a permutation of R. Define a map 0:Gal,K > S, 
by 6(0) = a | R. Since the operation in both groups is composition of 
functions, it is easy to verify that 0 is a homomorphism of groups. 
K = F(u,..., tt) by the definition of splitting field. If ø | R = T| R, then 
alu) = T(u;) for every i, and, hence, g = r by Theorem 12.4. Therefore, 
8 is an injective homomorphism, and thus Gal zk is isomorphic to Im 6, a 
subgroup of S,, by Theorem 7.20. W 


If K is the splitting field of f(x), we shall usually 
identify Gal-X with its isomorphic subgroup in S, 


by identifying each automorphism with the permutation it induces on the roots of 
So. 


EXAMPLE 3.A 
Let K be the splitting field of x3 — 2 over Q. Verify that the roots of xX — 2 are 


W2, V2.0, Vw, where w = (-1 + V3i)/2 is a complex cube root of 1. Then 
GalgX is a subgroup of Sj. By Theorem 12.3, there is at least one automorphism 
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g that maps the first root W2 to the second N20; it must take the third root 
1/207 to itself or to the first root W/2 by Theorem 12.2. So øg is either the permu- 
tation (12) or (123) in 53. 


CAUTION: When Xisthe splitting field of a polynomial f(x) € Fid, then by 
Corollary 12.5 every element of Gal,K produces a permuta- 
tion of the roots of f(x), but not vice versa: A permutation 
of the roots need not come from an F-automorphism of K. 
For example, Q(V3,V5) is a splitting field of f(x) = 
Q? — 3X22 — 5), but by Example 2.A there is no Q-automorphism 
of Q(V3,-V'5) that gives this permutation of the roots 
V3 -V3 V5 -v5 
l b 4 4 
V5 -V5 v3 -v3 
Let K be an extension field of F. A field E such that Fe ES Kis called an interme- 
diate field of the extension. In this case, we can consider K as an extension of E. The 
Galois group GalzĶ consists of all automorphisms of K that fix E elementwise. Every 


such automorphism automatically fixes each element of F since F & E. Hence, every 
automorphism in Gal ;X is in Gal;K, that is, 


if E is an intermediate field, Gal,K is a subgroup of Gal,K. 


EXAMPLE 2.B 


Q(V3) is an intermediate field of the extension Q( V3, V5) of Q. Example 2.4 
shows that GalgQ(V3,V5) = fa, T, a, B}. The automorphisms that fix every 
element of Q( V3) are exactly the ones that map V3 to itself by Theorem 12.4. 


Therefore, 
Galan Q(V3,V5) 
is the subgroup {c, a} of {2, 7, a, B}. 


We now have a natural way of associating a subgroup of the Galois group with 
each intermediate field of the extension. Conversely, if H is a subgroup of the Galois 
group, we can associate an intermediate field with H by using 


Theorem 12.6 


Let K be an extension field of F. If H is a subgroup of Gal,X, let 
Ey = {k EK |o(k) = k for every o EH}. 
Then &,, is an intermediate field of the extension. 


The field Ey is called the fixed field of the subgroup H. 
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Proof of Theorem 12.6» 1f c, dE Ey and o € H, then 
olc + d)=a(C.) + a(d)=c+d and a(cd) = a()o(d)= ed. 
Therefore, Ey is closed under addition and multiplication. Since 
o(0;) = 0;and o(1,) = 1, for every automorphism, 07 and 1,are in Ey. 
Theorem 3.10 shows that for any nonzero c in Ey and any g in H, 
o(-e) = -o() = -c and a(c')=aQ' =e. 


Therefore, —c € Ey and o`t € Ep. Hence, Ey is a subfield of K. Since 
His a subgroup of Gal ;K, a(c) = c for every ce F and every g € H. 
Therefore, FS Ey. W 


EXAMPLE 2.C 


Consider the subgroup H = {2, a} of the Galois group {, T, a, B} of a(v3, v5) 
over Q. Since a(V3) = V3, the subfield Q(v3 ) is contained in the fixed field 
Ey of H. To prove that Ey = Q( V3), you must show that the elements of Q( V3) 
are the only ones that are fixed by + and a; see Exercise 14. 


EXAMPLE 1.C 


As we saw in Example 1.B, GalpC = {c, 0}, where g is the complex conjuga- 
tion map. Obviously, the fixed field of the identity subgroup is the entire field 
C. Since g fixes every real mmber and moves every nonreal one, the fixed field 
of GalgC is the field R. 


Unlike the situation in the preceding example, the ground field F need not always 
be the fixed field of the group Gal pK. 


EXAMPLE 3.B 


Every automorphism in the Galois group of aQ(v2) over Q must map ‘V2 to 
a root of x? — 2 by Theorem 12.2. Example 3.A shows that WZ is the only 
real root of this polynomial. Since Q(v2) consists entirely of real numbers 
by Theorem 11.7, every automorphism in GalgQ(V2) must map V2 to itself. 
Therefore, GalgQ(v/2) consists of the identity automorphism alone by 
Theorem 12.4. So the fixed field of GalgQ(/2) is the entire field Q(/2). 


E Exercises 


NOTE: Unless stated otherwise, K is an extension field of the field F. 
A. 1. If g is an Fautomorphism of K, show that ¢~? is also an F-automorphism of K. 


2. Assume [K:F] is finite. Is it true that every F-automorphism of Kis completely 
determined by its action on a basis of K over F? 
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. If [K:F] is finite, g € GalzK, and u E K is such that g (u) = u, show that 


o €GalrayK. 


. Write out the operation table for the group 


GalgQ(V3, V5) = {t T, æ, B}. 


[See Example 2.A.] 


. Let f(x) € F[x] be separable of degree n and K a splitting field of f(x). Show 


that the order of Gal,K divides »!. 


. If Kis an extension field of Q and g is an automorphism of K, prove that g is 


a Q-automorphism. [Hint: o(1) = 1 implies that g (n) = n for allnEZ.] 


. (a) Show that GalgQ(V2) has order 2 and, hence, is isomorphic to 2). 


[Hint: The minimal polynomial is x? — 2; see Theorem 11.7.) 
(b) If dEQ and Vd ¢Q, show that GalgQ( Vd) is isomorphic to Z, 


. Show that GalgQ|(~/2) + q). 
. (a) Let w = (-1 + V3i)/2 be a complex cube root of 1. Find the minimal 


polynomial p(x) of w over Q and show that œ? is also a root of p(x). 
[Hint: w is a root of x — 1] 


(b) What is GalgQ(w)? 


. (a) Find GalgQ(V2, V3). [Hint: See Example 2.A.] 


(b) If p, q are distinct positive primes, find GalgQ(Vp, V4). 


. Find GalgQ(V2, i). [Hint: Consider Q & Q (V2) c Q( V2, i) and proceed as 


in Example 2.A.] 


. Show that GalgQ(V2, V3, V5) = Z, X Z, X Zp. 
. If F has characteristic 0 and K is the splitting field of f(x) € F[x], prove that 


the order of Gal,K is [K:F]. [Hint: K = F(u) by Theorems 11.17 and 11.18.] 


. Let H be the subgroup {t, a} of GalgQ(V3, V5) = {ı, 7, œ, 8}. Show that 


the fixed field of H is Q( V3). [Hint: Verify that Q( V3) S Eps Q(V3, V5); 
what is [Q( V3, V5): (V3)]?] 


. (a) Show that every automorphism of R maps positive elements to positive 


elements. [Hint.: Every positive element of R is a square.] 
(b) If a, bER, a < b, and g € GalgR, prove that g(a) < a(b). 
[Hint: a < b if and only if b — a > 0 


(©) Prove that GalgR = (ON [Hint: If ce <r < d, with c, dE Q, then ¢ < a(r) < d, 
show that this implies g(r) = r.] 


. Suppose č, č, . . . , £7 = 1 are n distinct roots of x" — 1 in some extension field 


of Q. Prove that GalpQ(Q) is abelian. 


. Let E be an intermediate field that is normal over F and g € Gal;K. Prove that 


o(£) = E. 
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12.2 The Fundamental Theorem of Galois Theory 


The essential idea of Galois theory is to relate properties of an extension field with 
properties of its Galois group. The key to doing this is the Fundamental Theorem of 
Galois Theory, which will be proved in this section. 

Throughout this section, K is a finite-dimensional extension field of F. Let S be the 
set of all intermediate fields and T the set of all subgroups of the Galois group Gal;K. 
Define a function 9:S > T by this rule: 


For each intermediate field E, QE) = Gal ,K. 


The function ¢g is called the Galois correspondence. Note that K (considered as a 
subfield of itself ) corresponds to the identity subgroup of Gal;K, and the subfield F 
corresponds to the entire group Gal;K (considered as a subgroup of itself). 


EXAMPLE 2.D* 


Consider the Galois correspondence for the extension Q(V3,V5) of @ and the 


intermediate field Q(v3). By the preceding remarks and Example 2.B on 
page 412, we have 


Q(V3,V5) — Galas, v9Q(V3,V5) = {}. 
Q(-V3) — GalawsQ( V3, V5) = {4.a}. 
Q —> GalaQ( V3, V5) = {i, 7, æ, B}. 


Example 2.C shows that E = Q( V3) is the fixed field of the subgroup H = (+, a} = 


Galgy)Q( V3, V5). Furthermore, K = Q(V3, V5) = Q(V3)(V5) is a normal, 
separable extension of the fixed field E = Q(V3) because it’s the splitting field of 
x? — 5 (Theorem 11.15) and has characteristic 0 (Theorem 11.17). 


We now construct the tools necessary to show that, under appropriate assump- 
tions, the Galois correspondence is a bijective map from the set of intermediate fields 
to the set of subgroups of Gal,K. 


Lemma 12.7 


Let K be a finite-dimensional extension field of F. ff H is a subgroup of the 
Galois group Gal,K and E is the fixed field of H, then K is a simple, normal, 
separable extension of E. 


Example 2.D above (with K = Q(-V3,V5), E = Q( V3), and H = {, a}) is an 
illustration of Lemma 12.7. 


“The numbering scheme for examples in Sections 12.1 and 12.2 is explained on page 408. 
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Proof of Lemma 12.7» Each ue Kis algebraic over F by Theorem 11.9 and, hence, 
algebraic over E by Exercise 7 in Section 11.2. Every automorphism in 
H must map u to some root of its minimal polynomial p(x) € E[x] by 
Theorem 12.2. Therefore, u has a finite number of distinct images under 
automorphisms in H, say u = uw, t,..., U, EK. 
If ¢ € H and ų = 7(u) (with 7 € H), then o(u,) = o(t(u)). Since g ° TEH, 
we see that g{u;) is also an image of u and, hence, must be in the set 


{uis U2... , Us}. Since g is injective, the elements g(x), ..., o(u,) are t 
distinct images of u and, hence, must be the elements wu, w,..., u, in some 
order. In other words, every automorphism in H permutes uy, uz... , up Let 


f(x) = Œ — x — uy) ++ + — u). 

Since the u, are distinct, f(x) is separable. We claim that f(x) is actually 
in E[x]. To prove this, let g € H and recall that g induces an isomor- 
phism K[x] = K[x] (also denoted g), as described on page 380. Then 

af (x) = (x = a(u)Xx — olu) ` ` > (x — o(ud). 
Since g permutes the x, it simply rearranges the factors of f(x), and, hence, 
af(x) = f(x). Therefore, every automorphism of H maps the coefficients 
of the separable polynomial f(x) to themselves, and, hence, these coeffi- 
cients are in Æ, the fixed field of H. Since u = u, is a root of f(x) & E[x], 
u is separable over E. Hence, K is a separable extension of E. 

The field K is finitely generated over F (since [K:F] is finite; see 
Example 4 in Section 11.3). Consequently, K is finitely generated over E, 
and, hence, K = E(u) for some u E K by Theorem 11.18. Let f(x) be as in 
the preceding paragraph. Then f(x) splits in K[x], and, hence, K = Eu) 
is the splitting field of f(x) over E. Therefore, Kis normal over E by 
Theorem 11.15. E 


Theorem 12.8 


Let K be a finite-dimensional extension field of F. If H is a subgroup of the 
Galois group Ga/-K and E is the fixed field of H, then H = Ga/;K and |H| = 
[K:E]. Therefore, the Galois correspondence is surjective. 


Proof» Lemma 12.7 shows that K = E(u) for some u E K. If p(x), the minimal 
polynomial of u over E, has degree n, then [K:E] = n by Theorem 11.7. 
Distinct automorphisms of Gal;K map u onto distinct roots of p(x) by 
Theorems 12.2 and 12.4. So the number of distinct automorphisms in 
Gal,K is at most z, the number of roots of p(x). Now H & Gal; K by the 
definition of the fixed field E. Consequently, 


|H| = |Gal,K| <n = [K:E]. 


Let f(x) be as in the proof of Lemma 12.7. Then H contains at least t 
automorphisms (the number of distinct images of u under A). Since 
u = u is a root of f(x), p(x) divides f(x). Hence, 


|A| = t = deg f(x) = deg p(x) = n = [K:E]. 
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Combining these inequalities, we have 
|H| <|Gal,K| < [K:E] = |H]. 
Therefore, |H| = |Gal,K| = [K:E], and, hence, H = Gal,K. m 


EXAMPLE 3.C 


The Galois group GalgQ( V2) = () by Example 3.B, so both of the intermedi- 
ate fields Q(/2) and Q are associated with () under the Galois correspondence. 
Note that Q(v2) is not a normal extension of Q [it doesn’t contain the com- 
plex roots of x? — 2, so this polynomial has a root but doesn’t split in Q( v2). 


Galois Extensions 


Although the Galois correspondence is surjective by Theorem 12.8, the preceding 
example shows that it may not be injective. In order to guarantee injectivity, additional 
hypotheses on the extension are necessary. The preceding proofs and example suggest 
that normality and separability are lilæly candidates 


Definition If K is a finite-dimensional, normal, separable extension field of the field F, 


we say that K isa Galois extension of F or that K is Galois over F. 


A Galois extension of characteristic 0 is simply a splitting field by Theorems 11.15 
and 11.17. 


Theorem 12. 9 


Let K be a Galois extension ot F and E an intermediate field. Then E is the fixed 
field of the subgroup Gal-K. 


If E and L are intermediate fields with Gal;K = Gal,K, then Theorem 12.9 shows 
that both E and L are the fixed field of the same group, and, hence, E = L, Therefore, 
the Galois correspondence is injective for Galois extensions. 


Proof of Theorem 12.9» The fixed field E, of Gal,K contains E by definition. To show 
that Æ & E, we prove the contrapositive: If u ¢ E, then u is moved by some 
automorphism in Gal;K, and, hence, u ¢ Æo. Since K is a Galois extension 
of the intermediate field E (normal by Theorem 11.15 and Exercise 5 of 
Section 11.4; separable by Exercise 1 of Section 11.5), it is an algebraic 
extension of E. Consequently, u is algebraic over E with minimal polyno- 
mial p(x) € E[x] of degree = 2 (if deg p(x) = 1, then u would be in £). The 
roots of p(x) are distinct by separability, and all of them are in K by normal- 
ity. Let v be a root of p(x) other than u. Then there exists g € Gal,K such 
that g(u) = v by Theorem 12.3. Therefore, u ¢ Eo, and, hence, Ay = E. E 
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Corollary 12.10 


Let K be a finite-dimensional extension field of F. Then K is Galois over F if and 
only if F is the fixed field of the Galois group Gal;-X. 


Proof» tf K is Galois over F, then Theorem 12.9 (with E = F) shows that F is 
the fixed field of Gal;K. Conversely, if F is the fixed field of GalpK, then 
Lemma 12.7 (with E = F) shows that Kis Galois over F. m 


In view of Corollary 12.10, a Galois extension is often defined to be a finite- 
dimensional one in which F is the fixed field of Gal-K. When reading other books on 
Galois theory, it’s a good idea to check which definition is being used so that you don’t 
make unwarranted assumptions. 


EXAMPLE 2.E 


The field Q( V3, V5)i is a Galois extension of Q because it is the splitting 

field of f(x) = (x* — 3)? — 5). So the Galois correspondence is bijective by 
Theorem 12.8 and the remarks after Theorem 12.9. The Galois group 
GalgQ(V3,V5) = {e, T, a, P} by Example 2.A. Verify the accuracy of the 
chart below, in which subfields and subgroups in the same relative position cor- 
respond to each other under the Galois correspondence. For instance, a V3) 
corresponds to {z, œ} by Example 2.B. 


Intermediate Fields Subgroups 
Q43, 45) &) 


ee ) G Pot B) 


(, T, a, B) 


Note that all the intermediate fields are themselves Galois extensions of Q 
(for instance, Q(v5 ) is the splitting field of x? — 5). Furthermore, the corre- 
sponding subgroups of the Galois group are normal. A similar situation holds 
in the general case, as we now see. 


Theorem 12.11 The Fundamental Theorem of Galois Theory 


If K is a Galois extension field of F, then 


(1) There is bijection between the set S of all intermediate fields of the 
extension and the set 7 of all subgroups of the Galois group Gal-K, 
given by assigning each intermediate field E to the subgroup 
Gal;K. Furthermore, 


(K:E] = |Gal-K| and [E:F] = [Gal,K:Galgk]. 
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(2) An intermediate field £ is a normal extension of F if and only if the 
corresponding group Gal-K is a normal subgroup of Gal,K, and in 
this case Gal = Gal-K/Gal,K. 


Proof Theorem 12.8 and the remarks after Theorem 12.9 prove the first state- 
ment in part (1). Each intermediate field E is the fixed field of Gal;K 
by Theorem 12.9. Consequently, [K:E] = |Gal;K| by Theorem 12.8. In 
particular, if F = E, then [K:F'] = |Gal,K|. Therefore, by Lagrange’s 
Theorem 8.5 and Theorem 11.4, 


[K:E]LE:F] = [K:F] = |GalpK| = |GalsK| [GalpK:GalpK]. 


Dividing the first and last terms of this equation by [K:E] = |Gal,K| 
shows that 


[E:F] = [GalpK:Gal,K]. 


To prove part (2), assume first that Gal;K is a normal subgroup of 
Gal,XK. If p(x) is an irreducible polynomial in F [x] with a root u in E, we 
must show that p(x) splits in E[x]. Since K is normal over F, we know that 
P(x) splits in K[x]. So we need to show only that each root v of p(x) in K is 
actually in Æ. There is an automorphism g in Gal,K such that o(u) = v by 
Theorem 12.3. If 7 is any element of Gal;K, then normality implies 
T° =o ° Tı for some 7, E Gal, K. Since u € E, we have r(v) = t(a(u)) = 
o(7,(u)) = a(u) = v. Hence, v is fixed by every element 7 in Gal;K and, 
therefore, must be in the fixed field of Gal; K, namely E (see Theorem 12.9). 

Conversely, assume that £ is a normal extension of F. Then Æ is finite 
dimensional over F by part (1). By Lemma 12.12, which is proved below, 
there is a surjective homomorphism of groups @:Gal,K — Gal, whose ker- 
nel is Gal,K. Then Gal Kis anormal subgroup of Gal,;K by Theorem 8.16, 
and Gal,K/Gal;K = Gal, by the First Isomorphism Theorem 8.20. m 


EXAMPLE 3.D 


The splitting field K of x’ — 2 is a Galois extension of Q whose Galois group is 
a subgroup of S, by Example 3.A.* Note that Q c QW2) c K. Since x’ — 2 

is the minimal polynomial of ¥/2, [(Q(W2):Q] = 3 by Theorem 11.7. Neither 
of the other roots (W2w and V2?) i is a real number, and, hence, neither is in 
Q(W/2). So [K:Q] > 3. Since [K:Q] = 6 (Theorems 11.13, 11.14) and [K:Q] is 
divisible by 3 (Theorem 11.4), we must have [K:Q] = 6. Thus GalgK has order 
6 by Theorem 12.11 and is $}. 

The only proper subgroups of S, are the cyclic group ((123)) of order 3 
and three cyclic groups of order 2: (( 12) ), ((13)), ((23)). Verify that the Galois 
correspondence is as follows, where subgroups and subfields in the same rela- 
tive position correspond to each other. The integer by the line connecting two 


*We consider S; as the group of permutations of the roots ¥/2,~/2e, Y2w? in this order. For instance, 
(12) interchanges Y2 and W/2w and fixes Zw? 
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subfields is the dimension of the larger over the smaller. The integer by the line 
connecting two subgroups is the index of the smaller in the larger. 


Intermediate Fields 


Subgroups 
Mt 
se 
i 


¥ <(23)> <(13)> <(12)> 


<(123)> Wr 


The field Q(@) is an intermediate field because w = (FED) EK. 


Q(«) is the splitting field of x* + x + 1 (Exercise 3) and, hence, Galois over Q. 
The corresponding subgroup is the normal subgroup ((123)). On the other 
hand, Example 3.C shows that Q(v2) is not Galois over Q; the corresponding 
subgroup ((23)) is not normal in S3. 


The preceding example illustrates an important fact: 
The Galois correspondence is inclusion-reversing. 


For instance, Q S Q(w), but the corresponding subgroups satisfy the reverse inclusion: 
S32 ((123)}. 
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Finally, we complete the proof of the Fundamental Theorem by proving 


Lemma 12,12 


Let K be a finite-dimensional norma! extension field of F and £ an intermedi- 
ate field, which is normal over F. Then there is a surjective homomorphism 
of groups @:Gal,K — Gal,£ whose kernel is Gal-K. 


Proof Let oe Gal,K and u E E. Then u is algebraic over F with minimal 
polynomial p(x). Since E is a normal extension of F, p(x) splits in Æ{x], 
that is, all the roots of p(x) are in E. Since o(u) must be some root of 
p(x) by Theorem 12.2, we see that o(u) E E. Therefore, o (E) = E for 
every g € Gal,K. Thus the restriction of g to E (denoted a | £) is an 
F-isomorphism E = g(E). Hence, [E:F] = [o(E):F] by Theorem 11.5. 
Since FS ø (E) & E, we have [E:F] = [E:o(E)][o(£):F] by Theorem 11.4, 
which forces [E:o(£)] = 1. Therefore, E = o(£), and g |E is actually an 
automorphism in Gal p£. 

Define a function 0:Gal K — GalzE by 6(¢) = g | E. It is easy to 
verify that 0 is a homomorphism of groups. Its kernel consists of the au- 
tomorphisms of K whose restriction to Æ is the identity map, that is, the 
subgroup Gal;K. 

To show that 6 is surjective, note that K is a splitting field over F 
by Theorem 11.15, and, hence, K is a splitting field of the same poly- 
nomial over E. Consequently, every T € Gal;E can be extended to an 
F-automorphism g in Gal;K by Theorem 11.14. This means that 
g |E = r, that is, 0(7) = 7. Therefore, 6 is surjective. W 


In the preceding proof, the normality of K was not used until the last paragraph. 
So the first paragraph proves this useful fact: 


Corollary 12.13 


Let K be an extension field of £ and £ an intermediate field that is normal over 
F. f o € Gal;K, then ø |EeGal£. 


@ Exercises 


NOTE: K is an extension field of the field F. 
A. 1. If Kis Galois over F, show that there are only finitely many intermediate fields. 
2. If Kis anormal extension of Q and [K:Q] = p, with p prime, show that 
GaloK = Z, 
3. (a) Show that w = (—1 + V3%)/2 is a root of x? — 1. 


(b) Show that w and w? are roots of x? + x + 1. Hence, Q(w) is the splitting 
field of x7 +x +1. 
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4. Exhibit the Galois correspondence of intermediate fields and subgroups for 
the given extension of Q: 


(a) Q(V@), where de Q, but Vd ¢ Q. 
(b) Q(w), where w is as in Exercise 3. 


5. If Kis Galois over F and Gal;K is an abelian group of order 10, how many 
intermediate fields does the extension have and what are their dimensions 
over F? 


6. Give an example of extension fields K and L of Fsuch that both K and L are 
Galois over F, K + L, and Gal,K = Gal;L. 


B. 7. Exhibit the Galois correspondence for the given extension of Q: 
(a) Q(V2,V3) œ) Ofi,V2) 


8. If K is Galois over F, Gal;K is abelian, and E is an intermediate field that is 
normal over F, prove that Gal;K and Gal;E are abelian. 


9. Let K be Galois over F and assume Gal;K = Z,,. 


(a) If E is an intermediate field that is normal over F, prove that Gal;K and 
Gal;£ are cyclic. 


(b) Show that there is exactly one intermediate field for each positive divisor 
of n and that these are the only intermediate fields. 


10. Two intermediate fields E and L are said to be conjugate if there exists 
g €Gal;K such that o(£) = L. Prove that E and L are conjugate if and 
only if GalzK and Gal,X are conjugate subgroups of Gal;K (as defined on 
page 308). 


11. (a) Show that K = Qv, i) is a splitting field of x* — 2 over Q. 
(b) Prove that [K:Q] = 8 and conclude from Theorem 12.11 that GalgK has 
order 8. [Hint: Q c Q(V2)< Q(V2, i).] 


(c) Prove that there exists @ E€ GalgK such that o(W/2)= (2) anda(i) =i 
and that g has order 4. 


(d) By Corollary 12.13 restriction of the complex conjugation map to K is an 
element 7 of GalgK. Show that 


GaloK = {a, 0”, 0°, o* = 4, T, OT, O°T, OPT}. 
[Hint: Use Theorem 12.4 to show these elements are distinct.) 
(e) Prove that GalgK = D,. [Hint: Mapa to r to 7 to v.] 
12. Let K be as in Exercise 11. Prove that GalggyK = Z4. 


C. 13. Let K be as in Exercise 11, Exhibit the Galois correspondence for this extension. 
[Among the intermediate fields are Q((1 + i) VZ) and Q((1 — i) ¥2).] 


14, Exhibit the Galois correspondence for the extension Q( V2, V3, V5) of Q. 
[The Galois group has seven subgroups of order 2 and seven of order 4.] 
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123 Solvability by Radicals 


The solutions of the quadratic equation ax? + bx + ¢ = 0 are given by the well-known 


formula 
—b + VBP — 4ac 
2a i 
This fact was known in ancient times. In the sixteenth century, formulas for the solu- 
tion of cubic and quartic equations were discovered. For instance, the solutions of 
x' + bx + c = 0 are given by 


x = W(-e/2) + Vd + X (—c/2) — Vd 
x = o(W(-c/2) + Vd) + oÝ (—c¢/2) — Vd) 
x = w{W(—c/2) + Vd) + w( WY (—c/2) — Vd), 


where d = (b°/27) + (c?/4), w = (Fl + V3i)/2 is a complex cube root of 1, and the 
other cube roots are chosen so that 


(W(—e/2) + Vay (-e/2) — Va) = —b/3.* 

In the early 1800s Ruffini and Abel independently proved that, for n = 5, there 
is no formula for solving all equations of degree n. But the complete analysis of the 
problem is due to Galois, who provided a criterion for determining which polynomial 
equations are solvable by formula. This criterion, which is presented here, will enable 
us to exhibit a fifth-degree polynomial equation that cannot be solved by a formula. To 
simplify the discussion, we shall assume that all fields have characteristic 0. 

As illustrated above, a “formula” is a specific procedure that starts with the coefficients 
of the polynomial f(x) € F[x] and arrives at the solutions of the equation f(x) = 0p by 
using only the field operations (addition, subtraction, multiplication, division) and the 
extraction of roots (square roots, cube roots, fourth roots, etc.). In this context, an ath 
root of an element c in F is any root of the polynomial x* — c in some extension field of F. 

If f(x) & F[x], then performing field operations does not get you out of the coef- 
ficient field F (closure!). But taking an nth root may land you in an extension field. 
Taking an mth root after that may move you up to still another extension field. Thus 
the existence of a formula for the solutions of f(x) = 0; implies that these solutions lie 
in a special kind of extension field of F. 


x= 


EXAMPLE 1 


Applying the cubic formula above to the polynomial x? + 3x + 2 shows that 
the solutions of x + 3x + 2 = 0 are 


V-I +V + W-1- V2, 
oW-1 + V2 + (@)W-1 - v2, 
(WDY -1 + V2 4+ aW-1-— V2. 


*The formulas for the general cubic and the quartic are similar but more complicated. 
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All these solutions lie in the extension chain: 
QSQ(w) SQ(w, V2) Q(o, V2, W-1 +-V2) < (0, V2, V-1 + V2, Y-1- V2) 
lll I I | 


her Sake Jc F; = E, 
Each field in this chain is a simple extension of the preceding oneand is of the form F{u), 
where WER for somen (that is, u is an nth root of some element of F)): 

F= Rw),  wherew? = 1EF, 

F,=F,(V2),  where(V2) =2eH°F,. 

= F,(W-1 + v2), where (W-1+ V2) =-1+ VER. 
F,=F,(W-1— V2), where (W/-1 - V2} = -1- VERS A, 

Since F, contains all the solutions of x* + 3x + 2 = 0, it also contains a splitting 
field of x? + 3x + 2. 


The preceding example is an illustration of the next definition. 


Definition A field K is said to be a radical extension of a field F if there is a chain of 
fields 


FHh, Sh of, o---ch=K 
such that for each / = 1,2,...,% 


F, = F(u) and some power of u; is in Ey 


Let f(x) € Fix]. The equation f(x) = Opis said to be solvable by radicals if there is a 
radical extension of F that contains a splitting field of f(x). The example above shows 
that x? + 3x + 2 = 0 is solvable by radicals. 

The preceding discussion shows that if there is a formula for its solutions, then the 
equation f(x) = Opis solvable by radicals. Contrapositively, if f(x) = 0; is not solvable 
by radical, then there cannot be a formula (in the sense discussed above) for finding its 
solutions. 


Solvable Groups 


Before stating Galois’ Criterion for an equation to be solvable by radicals, we need to intro- 
duce a new class of groups. A group G is said to be solvable if it has a chain of subgroups 


G = G@2G,2G,2°-+°26,_,2G, = (e) 
such that each G; is a normal subgroup of the preceding group G;_, and the quotient 
group G,_,/G; is abelian. 
EXAMPLE 2 


Every abelian group Gis solvable because every quotient group of G is abelian, 
so the sequence G2 (e) fulfills the conditions in the definition. 
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EXAMPLE 3 


Let ((123)) be the cyclic subgroup of order 3 in S3. The chain S32 ((123)) 2 ()) 
shows that S; is solvable. But for other symmetric groups we have 


Theorem 12,14 


For n = 5 the group S, is not solvable. 


Proof» Suppose, on the contrary, that S, is solvable and that 
S, = G2 G,2G,2°+-2G,= (1) 


is the chain of subgroups required by the definition. Let (rst) be any 
3-cycle in S, and let u, v be any elements of {1, 2, . . . , n} other than 

r, s, t (u and v exist because n = 5). Since S,/G, is abelian, Theorem 8.14 
(with a = (tus), b = (srv)) shows that G, must contain 


(tus)(srv)(tus)—'(sru)"! = (tus)(srv)(tsu)(sur) = (rst). 


Therefore, G, contains all the 3-cycles. Since G,/G» is abelian, we can 
repeat the argument with G; in place of S,, and G, in place of G, and 
conclude that Gz contains all the 3-cycles. The fact that each G;_;/G; is 
abelian and continued repetition lead to the conclusion that the iden- 
tity subgroup G, contains all the 3-cycles, which is a contradiction. 
Therefore, S,is not solvable. @ 


Theorem 12.15 


Every homomorphic image of a solvable group G is solvable. 


Proof» Suppose that f:G — H is a surjective homomorphism and that G = 
G22 Gh 2G= (ea) is the chain of subgroups in the defini- 
tion of solvability. For each i, let H, = f (G,) and consider this chain of 
subgroups: 


H = Hy) 2H, 2 H2: ° + 2 H, = f(eo)) = (ex) 
Exercise 22 of Section 8.2 shows that H; is a normal subgroup of H;-; 
for each ¿ = 1l, 2, . . . , ¢. Let a, b € H,-1. Then there exist ¢, dE G,_, such 
that f(c) = aand f(d) = b. Since G,_,/G;,is abelian by solvability, 
cedc~'d—! € G, by Theorem 8.14. Consequently, 
aba'b = f(A f(a) f(e f (d7) = f (edd) Ef (G) = H, 
Therefore, H,_,/H,is abelian by Theorem 8.14, and H is solvable. m 
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Galois’ Criterion 


If f(x) € F[x], then the Galois group of the polynomial f(x) is Gal-K, where K is a splitting 
field of f(x) over F.* Galois’ Criterion states that 


f(x) = Or is solvable by radicals if and only if the Galois 
group of f (x) is a solvable group. 


In order to prove Galois’ solvability criterion, we need more information about 
radical extensions and nth roots. If F is a field and ¢ is a root of x* — 1p in some 
extension field of F (so that {* = 1,), then ¢ is called an ath root of unity. The deriva- 
tive nx"! of x”? — 1pis nonzero (since F has characteristic 0) and relatively prime to 
x" — 1p. Therefore, x" — 1pis separable by Lemma 11.16. So there are exactly n distinct 
nth roots of unity in any splitting field K of x" — 1, If ¢ and 7 are nth roots of unity 
in K, then 


(D= Tr = Ip ly = 1p 


so that {7 is also an nth root of unity. Since the set of nth roots of unity is closed under 
multiplication, it is a subgroup of order n of the multiplicative group of the field K 
(Theorem 7.12) and is, therefore, cyclic by Theorem 7.16 or Corollary 9.11. A genera- 
tor of this cyclic group of nth roots of unity in Kis called a primitive nth root of unity. 
Thus ¢ is a primitive nth root of unity if and only if Z, (7, ©, . . . , {" = lpare then 
distinct nth roots of unity. 


EXAMPLE 4 


The fourth roots of unity in C are 1, —1, i ~i. Since Ë = —1,? = —é, and #* = 1, 
iis a primitive fourth root of unity. Similarly, —i is also a primitive fourth root of 
unity. DeMoivre’s Theorem shows that for any positive n, 


cos(27/n) + i sin(27/n) is a primitive nth root of unity in C. 
When n = 3, this states that 
w = cos(27/3) + i sin(27/3) = (-1/2) + (V3/2)i 


is a primitive cube root of unity. 


Lemma 12.16 


Let F be a field and ¢ a primitive nth root of unity in F. Then F contains a 
primitive dth root of unity for every positive divisor d of n. 


Proof» By hypothesis ¢ has order n in the multiplicative group of F. If n = dt, 
then ¢‘ has order d by Theorem 7.9. So ¢* generates a subgroup of order 
d, each of whose elements must have order dividing d by Corollary 8.6. 
In other words, ((¢94)4 = 1p for every k. Thus the d distinct powers £*, 


*Since any two splitting fields of f(x) are isomorphic by Theorem 11.14, it follows that the corre- 
sponding Galois groups are isomorphic. So the Galois group of f(x) is independent of the choice of K. 
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(“y,...,(f)d3, (f = 1pare roots of xf — 1p Since xf — 1phas at most 
droots and every dth root of unity is a root of xf — 1, ‘is a primitive dth 
root of unity. E 


We can now tie together the preceding themes and prove two theorems that are 
special cases of Galois’ Criterion as well as essential tools for proving the general case. 


Theorem 12.17 


Let F be a field of characteristic 0 and ¢ a primitive nth root of unity in some 


extension field of F. Then K = F(£) is a normal extension of F, and Gal;K is 
abelian. 


Proof» The field K = F(é) contains all the powers of £ and is, therefore, a split- 
ting field of x” — 1p.* Hence, K is normal over F by Theorem 11.15. 
Every automorphism in the Galois group must map ¢ onto a root of 
x” — 1p by Theorem 12.2. So if g, 7 € GalpK, then a(Q) = c* and 
7(£) = £ for some positive integers k, t. Consequently, 


(0 ° E) = oE) = of) = o GY = CY = &. 
(ro XO = NEY = re) = EF = CY = g". 
Therefore, g ° 7 = T ° g by Theorem 12.4, and GalpK is abelian. E 


Theorem 12,18 


Let F be a field of characteristic 0 that contains a primitive nth root of unity. 
If vis a root of x" — ce Hx] in some extension field of F, then K = F(u) isa 
normal extension of F, and Gal,;K is abelian. 


Proof t> By hypothesis, u" = c. If ¢ is a primitive nth root of unity in F, then for 
any k, 


C = (e = CH = Ire = 


Consequently, since ¿, Z7,..., ¿œ = 1 pare distinct elements of F, the ele- 
ments fu, Ż’u, Ču, . . . , ("u = ware the n distinct roots of x" — c. Hence, 
K = F(u) is a splitting field of x — c over F and is, therefore, normal 
over F by Theorem 11.15 If ø, 7, € Gal pX, then o (u) = f*u and 7(u) = 
t'u for some k, t by Theorem 12.2. Consequently, since ¿* and č“ are in F, 


*The field K = F(Z) is a radical extension of F since {” = 1p. Thus x° — 1¢ = 0z is solvable by radicals. 
So the theorem, which says that Gal;K (the Galois group of x" — 1;), is abelian (and hence, solvable), 
is a special case of Galois’ Criterion. 

tFor an alternate proof showing that Gal; is actually cyclic, see Exercise 22. 


The field K = F(u) is also a radical extension of F since u” = ceF, so x" — c = 0, is solvable by 
radicals. Hence, the theorem is another special case of Galois’ Criterion. 
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(5 © Tu) = o(r(u)) = oltu) = (iolu) = tu) = fu. 
(T ° ou) = Talu) = Tiu) = (rlu) = Flu) = ote. 
Therefore, g ° T = T ° g by Theorem 12.4, and Gal;K is abelian. E 


Theorem 12.19 Galois’ Criterion 


Let F be a field of characteristic 0 and f(x) eF{x]. Then f(x) = 0, is salvable by 
radicals if and only if the Galois group of f{x) is solvable. 


We shall prove only the half of the theorem that is needed below; see Section V.9 of 
Hungerford [5] for the other half. 


Proof of Theorem 12.19» Assume that f(x) = Opis solvable by radicals. The proof, 
whose details are on pages 429-431, is in three steps: 


1. Theorem 12.21: There is a normal radical extension K of F that con- 
tains a splitting field £ of f(x).* 
2. The field Eis normal over F by Theorem 11.15. 


3. Theorem 12.22: Any intermediate field of K that is normal over F has 
a solvable Galois group; in particular, Gal p£ (the Galois group of 
f(x)) is solvable. m 


Before completing the proof of Theorem 12.19, we use it to demonstrate the insol- 
vability of the quintic. 


EXAMPLE 5 


We claim that the Galois group of the polynomial f(x) = 2x5 — 10x + 5€ Q[x] 
is S;, which is not solvable by Theorem 12.14. Consequently, the equation 

2x5 — 10x + 5 = 0 is not solvable by radicals by Theorem 12.19. So, as 
explained on page 424, 


there is no formula (involving only field operations and 
extraction of roots) for the solution of all fifth-degree 
polynomial equations. 


To prove our claim, note that the derivative of f(x) is 10x* — 10, whose only 
real roots are +1 (the others being +i). Then f(x) = 402°, and the second- 
derivative test of elementary calculus shows that f(x) has exactly one relative 
maximum at x = —1, one relative minimum at x = 1, and one point of inflec- 
tion at x = 0. Soits graph must have the general shape shown on the next page. 
In particular, f(x) has exactly three real roots. 


*This is a crucial technical detail. The definition of solvability by radicals guarantees only a radical 
extension of F containing E. But a radical extension need not be normal over F (Exercise 19), and if 
it is not, the Fundamental Theorem 12.11 can't be used. 
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Note that f(x) is irreducible in Q[x] by Hisenstein’s Criterion (with p = 5). If K 
is a splitting field of f(x) in C, then GalgK has order [K:Q] by the Fundamental 
Theorem. If r is any root of f(x), then [K:Q] = [K:Q(r)] [Q(r):Q] by Theorem 11.4 
and [Q(r):Q] = 5 by Theorem 11.7. So the order of GalgK is divisible by 5. It 
follows that GalgK contains an element of order 5.* 

The group GalgK, considered as a group of permutations of the roots of 
f(x), is a subgroup of S; (Corollary 12.5). But the only elements of order 5 in 
S; are the 5-cycles (see Exercise 19 in Section 7.5). So GalgX contains a 5-cycle. 
Complex conjugation induces an automorphism on K (Corollary 12.13). This 
automorphism interchanges the two nonreal roots of f(x) and fixes the three 
real ones. Thus GalgX contains a transposition. Exercise 8 shows that the only 
subgroup of S; that contains both a 5-cycle and a transposition is S; itself. 
Therefore, Galo = S; as claimed. 


We now complete the proof of Galois’ Criterion, beginning with a technical lemma 
whose import will become clear in the next theorem. 


Lemma 12.20 


Let F, E, Ł be fields of characteristic 0 with 
FOECL=E(v) and ve. 


If L is finite dimensional over F and E is normal over F, then there exists 
an extension field M of L, which is a radical extension of E and a normal 
extension of F. 


Proof» By Theorem 11.15, E is the splitting field over F of some g(x) € F[x]. 
Let p(x) E F[x] be the minimal polynomial of v over Fand let M bea 
splitting field of g(x) p(x) over F. Then M is normal over F by Theorem 11.15. 
Furthermore, FS ES Le M (since L = E(v) and E is generated over 
F by the roots of g(x)). Let v = v, va, ..., v, be all the roots of 
P(x) in M. For each i there exists gE Gal;M such that g, (v) = v, by 


“If you have read Chapter 9 use Corollary9.14; otherwise, use Exercise9 in this section. 
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Theorem 12.3. Corollary 12.13 shows that g,(E) S E. By hypothesis, o* = 
bE E; so for each i, 


(w = axe} = o) = odb) CES Buy... , 1-1). 
Consequently, 


ES L= E(u) & E(u, v) SE (vy vq, vy) SS E (vgs v2, oe o o V) = M 
is a radical extension of E. m 


Theorem 12.21 


Let F be a field of characteristic 0 and f(x) €F[x]. If f(x) = Op is solvable by 
radicals, then there is a normal radical extension field of F that contains 
a splitting field of f(x). 


Proof» By definition some splitting field K of f(x) is contained in a radical 
extension 


FH=RcCrhCPCHhe:: -CF,, 


where F, = F, (w) and (w)™is in F, for each i= 1,2,..., t. Applying 
Lemma 12.20 with E = F, L = F,, and v = u; produces a normal radical 
extension field M, of F that contains F}. By hypothesis (u,)"e F, S M}. 
Applying Lemma 12.20 with E = M}, v = u, and L = M,(u,) produces 
a normal extension field M, of F that is a radical extension of M, and, 
hence, a radical extension of F. Furthermore, M, contains F, = F,(u,). 


Continued repetition of this argument leads to a normal radical exten- 
sion field M, of F that contains F, and, hence, contains K. m 


Theorem 12.22 


Let K be a normal radical extension field of F and £ an intermediate field, all 
of characteristic 0. If E is normal over F, then Ga/,£ is a solvable group. 


Proof By hypothesis there is a chain of subfields 
F=KSCF SRBC: -aF =K, 


where F, = F,_,(u,) and (;)" is in F,_, for each i = 1,2,..., £. Let n be 
the least common multiple of n4, nz, ..., n, and let £ bea are nth 
root of unity. For each i = 0, let E, = FÖ). Then for each i = 1 


= F O = Fuso) = Fils 2) = FEA) = Elu). 
Since (u,)"EF,_, S E; for i = land {" EF, 


FOESE,CE,CES:*+-SE,=L 
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is a radical extension of F that contains K (and, hence, £).* The normal 
extension K = F, is the splitting field of some polynomial p(x) € F[x] 
by Theorem 11.15, and, hence, L = E, = F,(£) is the splitting field of 
pP(x)(X" — 1p) over F. Therefore, L is Galois over F by Theorems 11.15 
and 11.17. 

Consider the following chain of subgroups of GalpL: 


Gal;L pe) Galz,L z Galz,L 2 Galz,L 2e Galz_,L = Gal, L = (e). 


We shall show that each subgroup is normal in the preceding one and 
that each quotient is abelian. Since each x, divides x, Ey contains a primi- 
tive nth root of unity by Lemma 12.16. Consequently, by Theorem 12.18 
each E, (with i = 1) is anormal extension of £;_,, and the Galois group 
Gal,_,E,is abelian. Since L is Galois over F, it is Galois over every Ej. 
Applying the Fundamental Theorem 12.11 to the extension L of E1, we 
see that Gal is a normal subgroup of Gal, L and that the quotient 
group Gal, _,L/GalzL is isomorphic to the abelian group Galy _£,. 
Similarly by Theorems 12.11 and 12.17, Eo is normal over F, Gal, L is 
normal in GalsL, and GalpL/GalgL is isomorphic to the abelian group 
Gal Eo. Therefore, Gal ,L is a solvable group. 

Since Eis normal over F, the Fundamental Theorem shows that 
GalgŻL is normal in GalL and Gal; /GalgL is isomorphic to Gal E. 
So Gal;F is the homomorphic image of the solvable group GalL 
(see Theorem 8.18) and is, therefore, solvable by Theorem 12.15. E 


E Exercises 


NOTE: F denotes a field, and all fields have characteristic 0. 
A. 1. Find a radical extension of Q containing the given number: 


(a) Y1 + V7 - W2 + V5 
b) (W/V72 + (W5) 
(c) (Y3 - V2)/(4 + v2) 


2. Show that x? — 3 and x? — 2x — 2€ Q[x] have the same Galois group. 
[Hint: What is the splitting field of each?] 


3. If Kis a radical extension of F, prove that [K:F] is finite. 
[Hint: Theorems 11.7 and 11.4.] 


*The construction of L does not use the hypothesis that K is normal over F, and, as we shall see 
below, every field in the chain is a normal extension of the immediately preceding one. But this is not 
enough to guarantee that L is normal (hence Galois) over F (Exercise 19). We need the hypothesis 
that K is normal over F to guarantee this, so that we can use the Fundamental Theorem on L. 
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4. Prove that for n = 5, A, is not solvable. [Hint: Adapt the proof of 
Theorem 12.14.] 


5. (a) Show that S, is a solvable group. [Hint: Consider the subgroup H = 
{(12)(34), (13)(24), (14)(23), (1)} of Ay] 
(b) Show that D, is a solvable group. 


6. If Gis a simple nonabelian group, prove that G is not solvable. [This fact and 
Theorem 8.26 provide another proof that A, is not solvable for n = 5.] 


7. List all the nth roots of unity in C when n = 
(a) 2 (b) 3 (c) 4 (d) 5 (e) 6 
B. 8. Let G be a subgroup of S; that contains a transposition g = (rs) and a 5-cycle a. 
Prove that G = S; as follows. 


(a) Show that for some k, a* is of the form (rsxyz). Let T = a* € G; by 
relabeling we may assume that g = (12) and 7 = (12345). 


(b) Show that (12), (23), (34), (45) € G. [Hint: Consider 7*or~* for k = 1]. 
(c) Show that (13), (14), (15) € G. [Hint: (12)(23)(12) = 7] 
(d) Show that every transposition is in G. Therefore, G = S; by Theorem 7.26. 


9. Let G be a group of order n. If 5|n, prove that G contains an element of order 
5 as follows. Let S be the set of all ordered 5-tuples (r, s, t, u, v) with r, s, t, u, 
vE Gand rstuv = e. 


(a) Show that S contains exactly n* 5-tuples. [Hint: If r, s, t, u, € G and v = 
(rstu)—', then (r, s, t, u, v) E S.] 


(b) Two 5-tuples in S are said to be equivalent if one is a cyclic permutation of 
the other.* Prove that this relation is an equivalencerelation on S. 


(c) Prove that an equivalence class in S either has exactly five 5-tuples in it or 
consists of a single 5-tuple of the form (r,r, r, r, r). 


(d) Prove that there are at least two equivalence classes in S that contain 
a single 5-tuple. [Hint: One is {(e, e, e, e, e)}. If this is the only one, 
show that n‘ = 1 (mod 5). But 5 |», so n° = 0 (mod 5), whichis a 
contradiction.] 


(e) If {(c, c, c, c, c)}, with c # e, is a single-element equivalence class, prove 
that ¢ has order 5. 


10. If N is a normal subgroup of G, N is solvable, and G/N is solvable, prove that 
Gis solvable. 


11. Prove that a subgroup H of a solvable group G is solvable. [Hint If G = G2 
G, 2° * + 2 G, = (e) is the solvable series for G, consider the groups H; = H A G, 
To show that H,-1/H; is abelian, verify that the map H,_,/H;—> Gi-1/G; given 
by H,x — G,x is a well-defined injective homomorphism.] 


“For instance, (r, S, £ u, v) is equivalent to each of (s, t, u, v, F), (2, u, ¥, f, S), (U, Y, T, S, £), (V, f, S, t, u), 
(7, s, & u, y) and to no other 5-tuples in S. 
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12. Prove that the Galois group of an irreducible quadratic polynomial is 
isomorphic to Z}. 


13. Prove that the Galois group of an irreducible cubic polynomial is isomorphic 
to Z, or S4. 

14. Prove that the Galois group of an irreducible quartic polynomial is solvable. 
[Hint: Corollary 12.5 and Exercises 5 and 11.] 


15. Let p(x), g(x) be irreducible quadratics. Prove thatthe Galois group of f(x) = 
P(x)g(x) is isomorphic to #, X Z, or Z2. [Hint: If u is a root of p(x) and va 
root of g(x), then there are two cases: v ¢ F(u) and ve F(u).] 


16. Use Galois’ Criterion to prove that every polynomial of degree = 4 is solvable 
by radicals. [Hint: Exercises 12-15.] 


17. Find the Galois group G of the given polynomial in Q[x]: 
(a) xX — 4x° + 4 [Hint: Factor] 
(b) x* — 5x7 + 6 
(c) Æ + 6x° + 9x 
(d) x* + 3x3 ~ 2x — 6 
(e) x — 10x — 5 [Hint: See Example 5] 
18. Determine whether the given equation over Q is solvable by radicals: 
(a) S+224+1=0 b) 3x° ~ 15x +5=0 
(c) 2x5 -— 5x4 +5 =0 (d) x5 — x* — 16x + 16=0 
19. (a) Prove that Q(V2i) is normal over Q by showing it is the splitting field of 
x + 2, 
(b) Prove that a(v2(1 =A )) is normal over Q(V2i) by showing that it is the 
splitting field of x? + 2V/2i. 


(c) Show that Q & Q(V2i) S Q(v2(1 — i)) is a radical extension of Q with 
[QYZ 1 — i)):Q] = 4 and note that Q contains all second roots of unity 
(namely + 1). 

(d) Let L = Q(W/2(1 — i)). Show that v = Y2(1 + i) is not in L. 

[Hint: If ve Land u = 9/2(1 — i) € L, show that v/u = iand (v — u)/2i = 
V2 L, which implies that [L:Q] = Qvz, i):Q], contradicting (c) and 
Exercise 12(b) in Section 12.2.] 

(e) Prove that L = Q(V2(1 — i)) is not normal over Q [Hint: u and v (as in 
(d)) are roots of the irreducible polynomial x* + 8.] 

20. Let ¢ be a primitive fifth root of unity. Assume Exercise 2 1 in Section 4.5 and 

prove that GalgQ(¿), the Galois group of x‘ — 1, is cyclic of order 4. 


21. What is the Galois group of x* + 32 over Q? [Hint: Show that Q(¢) is a 
splitting field, where ¢ is a primitive fifth root of unity; see Exercise 20.] 


22. Prove that the group Gal,K in Theorem 12.18 is cyclic. [Hint: Define a map 
f from GalpK to the additive group Z, by f(a) = k, where a (u) = ¢*u. Show 
that f is a well-defined injective homomorphism and use Theorem 7.17.] 
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C.23. If pis prime and G is a subgroup of S, that contains a transposition and a 
p-cycle, prove that G = S,. [Exercise 8 is the case p = 5.] 


24. If f(x) e Q[x] is irreducible of prime degree p and f(x) has exactly two 
nonreal roots, prove that the Galois group of f(x) is S,. [Example 5 is 
essentially the case p = 5.] 


25. Construct a polynomial in Q[x] of degree 7 whose Galois group is S}. 
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PART 3 


EXCURSIONS AND APPLICATIONS 
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CHAPTER ]Q 


Public-Key Cryptography 


Prerequisites: Section 2.3 


Codes have been used for centuries by merchants, spies, armies, and diplomats to trans- 
mit secret messages. In recent times, the large volume of sensitive matenal in government 
and corporate computerized data banks (much of which is transmitted by satellite or 
over telephone lines) has increased the need for efficient, high-security codes. 

It is easy to construct unbreakable codes for one-time use. Consider this “code pad”: 


Actual Word: morning evening Monday Tuesday attack 
Code Word: bat gixt ling button figle 


If I send you the message FIGLE BUTTON BAT, there is no way an enemy can know 
for certain that it means “attack on Tuesday morning” unless he or she has a copy of 
the pad. Of course, if the same code is used again, the enemy might well be able to 
break it by analyzing the events that occur after each message. 

Although one-time code pads are unbreakable, they are cumbersome and inef- 
ficient when many long messages must be routinely sent. Even if the encoding and 
decoding are done by a computer, it is still necessary to design and supply a new pad 
(at least as long as the message) to each participant for every message and to make all 
copies of these pads secure from unauthorized persons. This is expensive and imprac- 
tical when hundreds of thousands of words must be encoded and decoded every day. 

For frequent computer-based communication among several parties, the ideal code 
system would be one in which 


1. Each person has efficient, reusable, computer algorithms for encoding and 
decoding messages. 
2. Each person’s decoding algorithm is not obtainable from his or her encoding 
algorithm in any reasonable amount of time. 
437 
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A code system with these properties is called a public-key system. Although it may not 
beclear how condition 2 could be satisfied, it is easy to see the advantages of a public- 
key system. 

The encoding algorithm of each participant could be publicly announced—perhaps 
published in a book (like a telephone directory)—thus eliminating the need for couriers 
and the security problems associated with the distribution of code pads. This would not 
compromise secrecy because of condition 2: Knowing a person’s encoding algorithm 
would not enable you to determine his or her decoding algorithm. So you would have no 
way of decoding messages sent to another person in his or her code, even though you 
could send coded messages to that person. 

Since the encoding algorithms for a public-key system are available to everyone, forgery 
appears to be a possibility. Suppose, for example, that a bank receives a coded message 
claiming to be from Anne and requesting the bank to transfer money from Anne’s account 
into Tom’s account. How can the bank be sure the message was actually sent by Anne? 

The answer is as simple as it is foolproof. Coding and decoding algorithms are in- 
verses of each other: Applying one after the other (in either order) produces the word 
you started with. So Anne first uses her secret decoding algorithm to write her name; 
say it becomes Gybx. She then applies the bank’s public encoding algorithm to Gybx 
and sends the result (her “signature”) along with her message. The bank uses its secret 
decoding algorithm on this “signature” and obtains Gybx. It then applies Anne’s pub- 
lic encoding algorithm to Gybx, which turns it into Anne. The bank can then be sure 
the message is from Anne, because no one else could use her decoding algorithm to 
produce the word Gybx that is encoded as Anne. 

One public-key system was developed by R. Rivest, A. Shamir, and L. Adleman 
in 1977. Their system, now called the RSA system, is based on elementary number 
theory. Its security depends on the difficulty of factoring large integers. Here are the 
mathematical preliminaries needed to understand the RSA system. 


Lemma 13.1 
Let p, r, s, c eZ with p prime. ifp ¥ c and rc = sc (mod p), then r = s (mod p}. 


Proof since re = sc (mod p), p divides re — sc = (r —'s)c. By Theorem 1.5 
P\(r — s) or p|c. Since p ¥ c, we have p | (r — 5), and, hence, r = s (mod p). m 


mi: 

Lemma 13.2 Fermat’s Little Theorem 

If pis prime, 2€2, and p + a, then a°” = 1 (mod p). 

Proof* > None of the numbers a, 2a, 3a, . . . , (p — 1)ais congruent to 0 modulo 
p by Exercise 1. Consequently, each of them must be congruent to one 


of 1, 2, 3, .. ., p — 1 by Corollary 2.5 and Theorem 2.3. If two of them 
were congruent to the same one, say ra = i = sa (mod p) with 


1sir,ssp-1, 


*A proof based on group theoryis outlined in Exercise 38 of Section 7.3,and one based on field theory 
is in Exercise 13 of Section 11.6. 
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then we would have r = s (mod p) by Lemma 13.1 (with c = a). This is 
impossible because no two of the numbers 1, 2, 3,..., p — 1 are con- 
gruent modulo p (the difference of any two is less than p and, hence, 
not divisible by p). Therefore, in some order a, 2a, 3a,...,(p — l)a are 
congruent to 1, 2, 3,..., p — 1. By repeated use of Theorem 2.2, 


a: 2a: 3a-..(p— la=1-2+3..-(p—1) (modp). 


Rearranging the left side shows that 


a:aca-..a°1+2+3...(p—1) =1:2+3..-(p—1) (modp) 
1 +2-3..-(p— 1) =1(1°2°3---(p—1)) (mod p). 


Now p ¢(1-2-3..-(p — 1)) (if it did, p would divide one of the fac- 
tors by Corollary 1.6. Therefore, a’—! = 1 (mod p) by Lemma 13.1 (with 
c=1:2-3...(p—1). E 


Throughout the rest of this discussion p and q are distinct positive primes. Let 

= pq and k = (p — 1)(q — 1). Choose d such that (d, k) = 1. Then the equation 

dx = 1 has a solution in Z, by Theorem 2.9 (with n = k). Therefore, the congruence 
dx = 1 (mod &) has a solution in Z; call it e. 


Theorem 13.3 


Let p, g, N, k, e, d be as in the preceding paragraph. Then b°“ = b (mod n) for 
every bež. 


Proof > Since e is a solution of dx = 1 (mod k), de — 1 = kt for some t. Hence, 
ed = kt + 1, so that 


bed faa peel = pb! an be-1X:-1h m. (bP Y¥a-Dp, 
If p ¥ b, then by Lemma 13.2, 
B1 = (B-09 = (1)9-"™ b = b (mod p). 


If p | b, then b and every one of its powers are congruent to 0 modulo p. 
Therefore, in every case, b°? = b (mod p). A similar argument shows that 
b4 = b (mod q). By the definition of congruence, 


p|@“—-b) and = q|(6“- b). 


Therefore, pq | (b — b) by Exercise 2. Since pq = n, this means that n 
divides (b“ — b), and, hence, b”? = b (mod n). E 


The least residue modulo # of an integer c is the remainder r when c is divided 
by n. By the Division Algorithm, c = nq + r, so that c — r = nq, and, hence, c =r 
(mod 7). Since two numbers strictly between 0 and n cannot be congruent modulo 
n, the least residue of c is the only integer between 0 and n that is congruent to e 
modulo z. 
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We can now describe the mechanics of the RSA system, after which we shall show 
how it satisfies the conditions for a public-key system. The message to be sent is first 
converted to numerical form by replacing each letter or space by a two-digit number:* 


space = 00, A = 01, B = 02,..., Y = 25, Z = 26. 


For instance, the word GO is written as the number 0715 and WEST is written 
23051920, so that the message “GO WEST” becomes the number 07150023051920, 
which we shall denote by B. 

Let p, q, n, k, d, e, be as in Theorem 13.3, with p and g chosen so that B < pq = n. 
To encode message B, compute the least residue of B* modulo n; denote it by C. Then 
C is the coded form of B. Send C in any convenient way. 

The person who receives C decodes it by computing the least residue of C? modulo 
n. This produces the original message for the following reasons. Since B®, is congruent 
modulo z to its least residue C, Theorem 13.3 shows that 


C’ = (Bf = B“ = B (mod a). 


The least residue of C? is the only number between @ and n that is congruent to C4 
modulo n and 0 < B < n. So the original message B is the least residue of C’. 

Before presenting a numerical example, we show that the RSA system satisfies the 
conditions for a public-key system: 


1. When the RSA system is used in practice, p and g are large primes (several hun- 
dred digits each). Such primes can be quickly identified by a computer. Even 
though B, e, C, d are large numbers, there are fast algorithms for finding the 
least residues of B° and C? modulo n. They are based on binary representation 
of the exponent and do not require direct computation of B° or C4(which would 
be gigantic numbers). See Knuth [31] for details. So the encoding and decoding 
algorithms of the RSA system are computationally efficient. 

2. To use the RSA system, each person in the network uses a computer to choose 
appropriate p, q, d and then determines n, k, e. The numbers e and n for the 
encoding algorithm are publicly announced, but the prime factors p, q of n and 
the numbers d and k are kept secret. Anyone with a computer can encode mes- 
sages by using e and n. But there is no practical way for outsiders to determine 
d (and, hence, the decoding algorithm) without first finding p and g by factoring 
n.t With present technology this would take thousands of years! So the RSA 
system appears secure, as long as new and very fast methods of factoring are 
not developed. 


Even when n is chosen as above, there may be some messages that in numerical 
form are larger than n. In such cases the original message is broken into several blocks, 
each of which is less than n. Here is an example, due to Rivest-Shamir-Adleman. 


*More numbers could be used for punctuation marks, numerals, special symbols, etc. But this will be 
sufficient for illustrating the basic concepts. 


tAlternatively, one might try to find k and then solve the congruence ex = 1 (mod k) to get d. But this 
can be shown to be computationally equivalent to factoring n, so no time is saved. 
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EXAMPLE 1 


Let p = 47 and q = 59. Then n = pg = 47+ 59 = 2773 and k = (p — 1X(q —- 1) = 
46 - 58 = 2668.* Let d = 157. A graphing calculator or computer quickly veri- 
fies that (157, 2668) = 1 and that the solution of 157x = 1 (mod 2668) is e = 17. 
We shall encode the message “IT’S ALL GREEK TO ME.” We can encode only 
numbers less than z = 2773. So we write the message in two-letter blocks (and 
denote spaces by #): 


IT S# AL L# GR 
0920 1900 0112 1200 0718 


EE K# TO #M E# 
0505 1100 2015 0013 0500. 


Then each block is a number less than 2773. The first block, 0920, is encoded by 
usinge = 17 and a computer to calculate the least residue of 920! modulo 2773: 


920!" = 948 (mod 2773). 
The other blocks are encoded similarly, so the coded form of the message is 
0948 2342 1084 1444 2663 
2390 0778 0774 0219 1655. 


A person receiving this message would use d = 157 to decode each block. For 
instance, to decode 0948, the computer calculates 


94815? = 920 (mod 2773). 
This is the original first block 0920 = IT. 


For more information on cryptography and the RSA system, see Hoffstein, Pipher, 
and Silveman [33], Rivest-Shamir-Adleman [34], Simmons [35], and Trappe and 
Washington [36]. 


@ Exercises 


A. 1. Let p be a prime and k, a € Z such that p y a and 0 < k < p. Prove that ka #0 
(mod p). [Hint: Theorem 1.5.] 


2. If p and q are distinct primes such that p | c and q | c, prove that pg | c. [Hint: 
If ¢ = pk, then q | pk; use Theorem 1.5.] 


“These numbers will illustrate the concepts. But they are too small to provide a secure code since 
2773 can be factored by hand. 

tTo solve the congruence on a calculator, use the Technology Tip on page 12 to find u and y such that 
157u + 2668v = 1. Then 157u — 1 = 2668y, which means that 157u = 1 (mod 2668). 
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3. Use a calculator and the RSA encoding algorithm with e = 3, n = 2773 to 
encode these messages: 


(a) GO HOME (b) COMEBACK (e) DROP DEAD 
[Hint: Use 2-letter blocks and don’t omit spaces,] 
4. Prove this version of Fermat’s Little Theorem: If p isa prime and a EZ, then 


@ = a (mod p). [Hint: Consider two cases, p |a and p ¥ a; use Lemma 13.2 in 
the second case.] 


B. 5. Find the decoding algorithm for the code in Exercise 3. 


6. Let C be the coded form of a message that was encoded by using the RSA 
algorithm. Suppose that you discover that C and the encoding modulus n 
are not relatively prime. Explain how you could factor n and thus find the 
decoding algorithm. [The probability of such a C occurring is less than 10-” 
when the prime factors p, q, of n have more than 100 digits.] 
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CHAPTER Iå 


The Chinese Remainder Theorem 


Prerequisites: Section 2.1 and Appendix C for Section 14.1; Section 3.1 
for Section 14.2; Section 6.2 for Section 14.3. 


TheChinese Remainder Theorem (Section 14.1) is a famous result in number theory 
that was known to Chinese mathematicians in the first century. It also has practical 
applications in computer arithmetic (Section 14.2). An extension of the theorem 
to rings other than Z has interesting consequences in ring theory (Section 14.3). 
Although obviously motivated by Section 14.1, Section 14.3 is independent of the 
rest of the chapter and may be read at.any time after you have read Section 6.2. 


14 Proof of the Chinese Remainder Theorem 


A congruence is an equation with integer coefficients in which “=” is replaced by 
“= (mod n).” The same equation can lead to different congruences, such as 


6x + 5 = 7 (mod 3) or 6x + 5 = 7 (mod 5). 


Only integers make sense as solutions of congruences, so the techniques of solving 
equations are not always applicable to congruences. For instance, the equation 6x + 5 = 7 
has x = 1/3 as a solution, but the congruence 6x + 5 = 7 (mod 3) has no solutions 
(Exercise 3), and 6x + 5 = 7 (mod 5) has infinitely many solutions (Exercise 4). 

A number of theoretical problems and practical applications require the solving of 
a system of linear congruences, such as 


x = 2 (mod 4) 
x = 5 (mod 7) 
x = 0 (mod 11) 
x = 8 (mod 15) 
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A solution of the system is an integer that is a solution of every congruence in the sys- 
tem. We shall examine some cases in which a system of linear congruences must have 
a solution. 


Lemma 14.1 

lf mand n are relatively prime positive integers and a, beZ, then the system 
x = a (mod m) 
x = b {mod n) 

has a solution. 

Proof» since (m, n) = 1, there exist integers u and v such that mu + nv = 1 by 


Theorem 1.2. This equation and the definition of congruence lead to 
four conclusions: 


(i) mu = 0 (mod m) (ii) nv = 1 (modm) [Because | — nv = mu] 
(iii) wv = 0 (mod n) (iv) mu =1 (mod n) [Because 1 — mu = nv] 
Let t = bmu + anv. Then by (i), (ii), and Theorem 2.2, 
t = bmu + aw =b -0 + a-l =a (mod m), 
so that t = a (mod m). Similarly, by (iii), (iv), and Theorem 2.2, 
t = bmu + anv =b :1 +a'0 =b (modn), 
so that t = b (mod n). Therefore, ¢ is a solution of the system. E 
The proof of Lemma 14.1, provides the 
Solution Algorithm for the System in Lemma 14.1 
1. Find u and v such that mu + nv = 1.* 


2. Then ¢ = bmu + anv is a solution of the system 
EXAMPLE 1 


To solve the system 
x = 2 (mod 4) 
x = 5 (mod 7), 
apply the algorithm with m = 4,n =7,a =2,b = 5: 
1. Itis easy to see that u = 2, v = —1 satisfy 4u + w = 1. 
2. Therefore, a solution of the system is 


t = bmu + aw = 54:2 +2.7:(—1)= 26. 


*This can be done by hand by using the Euclidean Algorithm; see Exercise 15 in Section 1.2. It can 
also be done on a computer or graphing calculator; see the Technology Tip on page 12. 
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Theorem 14.2 The Chinese Remainder Theorem* 


Let My Mo,...,M, be pairwise relatively prime positive integers (meaning 
that (m, m) = 1 whenever į # j). Let a;, a, ..., & be any integers. 


(1) The system 


x = a, (mod m,) 
x = a (mod my) 
x = a3 (mod mM) 


x = & (mod m) 
has a solution. 
(2) lf £ is one solution of the system, then an integer z is also a solution 
if and only if 2 = £ (mod m, Ma M3 +++ m,). 


For reasons that will become apparent below, we shall use induction to prove the 
first part of the theorem. For a proof that does not use induction, see Exercise 21. 


Proof of Theorem 14.2 » (1) The proof is by induction on the number r of congru- 
emes in the system. If r = 2, then there is a solution by Lemma 14.1 
(with m = m,, n = Mm, a = a;, b = a,). So-suppose inductively that there 
is a solution when r = k and consider the system 
x = a, (mod m) 
x = a (mod m) 
x = a, (mod m,) 


(*) 


x = a, (mod m,) 
xX = ap}, (mod my,1) 


By the induction hypothesis, the system consisting of the first k congru- 

ences in (+) Aas a solution s. Furthermore, mmm; > * + m, and m44; are 

relatively prime (Exercise 5). Consequently, by Lemma 14.1, the system 
x=s (mod mmm: mą) 

(++) X = Ag+ (mod m,,1) 


*So named because it was known to Chinese mathematicians in the first century. 
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has a solution t. The number f necessarily satisfies 
t = s (mod mmm; *** mọ). 
Consequently, for each i= 1, 2, 3, . . . , k, 
t = s (mod m). 


(Reason: If t — s is divisible by mmm; * - - My then it is divisible by each 
m). Now s is a solution of the first k congruences in (++), so for each i= k 


t = s (mod m;) and s=a;(modm). 
By transitivity (Theorem 2.1), 
t = a; (mod m) fori=1,2,...,k. 


Since ¢ is a solution of (++), it must also satisfy £ = a,,, (mod My41). 
Hence, t is a solution of the system (+), so that there is a solution 
when r = k + 1. Therefore, by induction, every such system has a 
solution. 

(2) If z is any other solution of the system, then for each i = 1,2, ..., r, 


z =a,(mod m) and t = a (mod m). 
By transitivity (Theorem 2.1), z = t (mod m). Thus 
m | (z — £), m | (2 — i), m | (z — ô, . .. , m, | (Z — Ò. 
Therefore, mmm; - - - m, | (z — 0) by Exercise 7. Hence, 
z = t (mod mmm; -« - - m). 


Conversely, if z = t (mod mmyn; - - - m,), then, as above, z = t (mod m) 
foreach i = 1, 2,... , r. Since t= a (mod m,), transitivity shows that z = a, 
(mod 7n,) for each i. Therefore, z is a solution of the system. E 


The proof of Theorem 14.2 actually provides an effective computational algorithm 
for solving large systems: Solve the first two by Lemma 14.1, then repeat the inductive 
step as often as needed to determine a solution of the entire system. 


EXAMPLE 2 
We shall solve the system 
x = 2 (mod 4) 
x = 5 (mod 7) 
x = 0 (mod 11) 
x = 8 (mod 15). 
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Example | shows that x = 26 is a solution of the system consisting of the first two 
congruences: 
x = 2 (mod 4) 
x = 5 (mod 7). 
Next we solve the system 
x = 26 (mod 4: 7) 
x = 0 (mod 11). 
First, note that u = 2 and v = —5 satisfy 28u + llv = 1.* Then the Solution 


Algorithm preceding Example 1 (with a = 26, m = 4 - 7 = 28, b = 0, n = 11) shows 
that a solution is 


bmu + anv = 0+ 28 - 2 + 26 - 11 + (—5) = —1430. 


You can readily verify that x = —1430 is also a solution of the system consisting of the 


first three congruences: 
x = 2 (mod 4) 
x = 5 (mod 7) 
x = 0 (mod 11). 


Finally, we solve this system: 


x = —1430 (mod 4 : 7 = 11) 
x=8 (mod 15). 


Note that u = 2 and v = —41 satisfy 308u + 15v =1.* So by the Solution Algorithm 
(with a = —1430, m = 4:7+ 11 = 308, b = 8, n = 15), a solution is 


bmu + anv = 8 + 308 + 2 + (-1430) + 15+ (—41) = 884,378. 


You can venfy that x = 884,378 is a solution of the entire system 


x = 2 (mod 4) 
x = 5 (mod 7) 
x = 0 (mod 11) 
x = 8 (mod 15). 


Since 4: 7 « 11 + 15 = 4620 and 884,378 = 1958 (mod 4620), as you can easily 
verify, x = 1958 is also a solution of the system by Theorem 14.2. When work- 
ing by hand, the smaller solution is easier to use. So we say that the solutions 
of the system are all numbers that are congruent to 1958 modulo 4620. 


“The values for u and y were found with a graphing calculator program; see the Technology Tip on 
page 12. 
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Technology Tip: Systems such as the one in Example 2 can be solved by the 
Chinese Remainder Theorem program for TI graphing calculators that can be 
downloaded from our website (ADDRESS TBA). In Example 2, when asked, 
you enter the list of constants {2, 5, 0, 8} and the corresponding list of moduli 
{4, 7, 11, 15}. The program then produces the solution, as shown in Figure 1. 


SOLUTI on ess 


FIGURE 1 


To solve the same system with Maple, use the command 
chrem ({2, 5, 0, 8], [4, 7, 11, 15]; . 


E Exercises 


A. 1. If u =v(modz) and wis a solution of 6x + 5 = 7 (mod n), then show that v is 
also a solution. [Hint: Theorem 2.2.] 


2. If 6x + 5 =7 (mod n) has a solution, show that one of the numbers 1, 2, 3,..., 
n — 1 is also a solution. [Hint: Exercise 1 and Corollary 2.5.] 


3. Show that 6x + 5 = 7 (mod 3) has no solutions. [Hint: Exercise 2.] 


4. Show that 6x + 5 =7 (mod 5) has infinitely many solutions. 
[Hint: Exercises 1 and 2.] 

5. If m, Mh, ..., Mp Mg; are pairwise relatively prime positive integers (that is, 
(m, m) = 1 when i + f), prove that myn, > > + m, and m,,, are relatively prime. 
[Hint: If they aren't, then some prime p divides both of them (Why). Use 
Corollary 1.6 to reach a contradiction.] 

6. If (m, n) = 1 and m | d and n | d, prove that mn | d. [Hint- If d = mk, then 
n | mk; use Theorem 1.4.] 

7. Let m,, Mz, . . . , m, be pairwise relatively prime positive integers (that is, 
(mm) = 1 when i + f). Assume that m, | d for each i. Prove that 
mmm; - - - m, |d. [Hint: Use Exercises 5 and 6 repeatedly.] 


In Exercises 8-13, solve the system of congruences. 


8. x = 5 (mod 6) 9. x = 3 (mod 11) 
x = 7(mod 11) x = 4(mod 17) 
10. x = 1 (mod 2) 11. x = 2(mod 5) 
x = 2 (mod 3) x = 0 (mod 6) 
x = 3 (mod 5) x = 3 (mod 7) 
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12. x = 1 (mod 5) 13. x = 1 (mod 7) 
x = 3 (mod 6) x = 6 (mod 11) 
x = 5(mod 11) x = 0 (mod 12) 
x = 10 (mod 13) x = 9 (mod 13) 
x = 0 (mod 17) 
B. 14. (Ancient Chinese Problem) A gang of 17 bandits stole a chest of gold coins. 


18. 
19. 


20. 


21. 


When they tried to divide the coins equally among themselves, there were 
three left over. This caused a fight in which one bandit was killed. When the 
remaining bandits tried to divide the coins again, there were ten left over. 
Another fight started, and five of the bandits were killed. When the survivors 
divided the coins, there were four left over. Another fight ensued in which 
four bandits were killed. The survivors then divided the coins equally among 
themselves, with none left over. What is the smallest possible number of coins 
in the chest? 


. If (a, n) = d and d | b, show that ax = b (mod n) has a solution. [Hint: b = de 


for some c, and au + nv = d for some u, v (Why?). Multiply the last equation 
by c; what is auc congruent to modulo n?) 


. If (a, n) = d and d ¥ b, show that ax = b (mod n) has no solutions. 


. If (a, n) = 1 ands, t are solutions of ax = b (mod n), prove that s = ¢ (mod n). 


[Hint: Show that n | (as — at) and use Theorem 1.4.] 
If (a, n) = d and s, t are solutions of ax = b (mod n), prove that s = ¢ (mod n/d). 
If (m, n) = d, prove that the system 

x =a (mod m) 

x = b (mod n) 


has a solution if and only if a = b (mod d). 


If s, tare solutions of the system in Exercise 19, prove that s = t (mod r), 
where r is the least common multiple of m and n. 


(Alternate Proof of part (1) of the Chinese Remainder Theorem) For each 
i= 1,2,...,97, let N; be the product of all the m, except m, that is, 


N, = mm: ++ MyM ++ + My 


(a) For each i, show that (N, m) = 1, and that there are integers u; and v, such 
that Na; + my; = 1. 


(b) For each é and j such that i +J, show that Nj, = 0 (mod m). 
(c) For each i, show that Naz, = 1 (mod mi). 


(d) Show that t = a,Nyu, + a,Nzu, + aNu + `> * + @,N,u, is a solution of 
the system. 
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14.2 Applications of the Chinese Remainder Theorem 


Every computer has a limit on the size of integers that can be used in machine arith- 
metic, called the word size. In a large computer this might be 2°°. Computer arithmetic 
with integers larger than the word size requires time-consuming multiprecision tech- 
niques. In such cases an alternate method of addition and multiplication, based on the 
Chinese Remainder Theorem, is often faster. 

For any numbers r, $, t, n less than the word size, a large computer can quickly 
calculate 


r + sandr - s (even when the answer is larger than the word size); 


the least residue of t modulo n* (including the case when f exceeds the word size— 
see Exercise 2); 


sums and products in Z,,. 


Finally, a computer can use a slight variation of the Chinese Remainder Theorem 
solution algorithm (Theorem 14.2) to solve systems of congruences. But this may 
involve numbers larger than the word size and, hence, require slower multiprecision 
techniques. 

To get an idea of how the alternate method works, imagine that the word size of 
our computer is 100, so that multiprecision techniques must be used for larger num- 
bers. The following example shows how to multiply two four-digit numbers on such a 
computer, with minimal use of multiprecision techniques. 


EXAMPLE 1 


We shall multiply 3456 by 7982 by considering various systems of congruences 
and using the Chinese Remainder Theorem. We begin by choosing several 
numbers as moduli and finding the least residues of 3456 and 7982 for each 
modulus:* 


3456 = 74 (mod 89) 7982 = 61 (mod 89) 
3456 = 36 (mod 95) 7982 =2 (mod 95) 
(*)  3456=61(mod97) 7982 = 28 (mod 97) 
3456 = 26 (mod 98) 7982 = 44 (mod 98) 
3456 = 90 (mod 99) 7982 = 62 (mod 99). 


Then by Theorem 2.2 we lanow that 3456 - 7982 = 74 - 61 (mod 89). Taking the 
least residue of 74 - 61 modulo 89 and proceeding in similar fashion for the other 
congruences, we have 


*The least-residue modulo n of a number £ is the remainder r when fis divided by n. By the Division 
Algorithm, £ = ng + r so that t — r = nq and t = r (mod n). 
tThe reason why 89, 95, 97, 98, and 99 were chosen as moduli will be explained below. 
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3456 + 7982 = 74 - 61 = 64 (mod 89) 
3456 - 7982 = 36-2 =72(mod 95) 
(+s) 3456 - 7982 = 61 - 28 = 59 (mod 97) 
3456-7982 = 26 + 44 = 66 (mod 98) 
3456 + 7982 = 90 - 62 = 36 (mod 99). 


Therefore, 3456 - 7982 is a solution of this system: 


x = 64 (mod 89) 
x = 72 (mod 95) 
(###) x = 59 (mod 97) 
x = 66 (mod 98) 


x = 36 (mod 99). 


The Chinese Remainder Theorem* shows that one solution of (+*+) is 27,585,792 
and that every solution (including 3456 - 7982) is congruent to this one modulo 
89 +95 + 97 + 98 - 99 = 7,956,949,770 (which we denote hereafter by M). Since no two 
numbers between 0 and M can be congruent modulo M, 27,585,792 is the only solu- 
tion between 0 and M. We lanow that 0 < 3456 - 7982 < 10* - 10* = 10° < M. Since 
3456 - 7982 is a solution, we must have 3456 - 7982 = 27,585,792. 


Now look at this example from a different perspective. If you think of the least 
residue of a number modulo n as an element of Z,, then the congruences in (+) say 
that the integer 3456 may be represented by the element (74, 36, 61, 26, 90) in the ring 
Zso X Los X Zo7 X Log X Lo. Similarly, 7982 is represented by (61, 2, 28, 44, 62). Saying 
that 74 - 61 = 64 (mod 89) in (xx) is the same as saying 74 - 61 = 64 in Z. So the 
congruences in (+*+) are equivalent to multiplication in Zgg X Zos X Zo; X Log X Zog: 


(74, 36, 61, 26, 90) « (61, 2, 28, 44, 62) = (74 - 61, 36 - 2,61 - 28, 26 > 44, 90 - 62) 
= (64, 72, 59, 66, 36). 


The solution of (+++) shows that the element (64, 72, 59, 66, 36) of the ring 
Zgo X Zos X Loz X Log X Zo represents the integer 27,585,792. 

The procedure in the case of a realistic word size is now clear. Let m, . . . , m, be 
pairwise relatively prime positive integers: 


1. Represent each integer t as an element of Zm, X +++ X Zm, by taking the congru- 
ence class of t modulo each m, 

2. Do the arithmetic in Zm, X +++ X Zm, 

3. Use the Chinese Remainder Theorem to convert the answer into integer form. 


The m, must be chosen so that their product M is larger than any number that will 
result from the computations. Otherwise, the conversion process in Step 3 may fail 
(Exercises 3-5). This is sometimes done, as in the example, by taking the m, to be as 


*Up to this point, all computations have been quickly performed by our imaginary computer. This is 
the first place where slower multiprecision calculations may be needed because of numbers that 
exceed the word size. 
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large as possible without exceeding the word size of the computer. If smaller moduli 
are chosen, more of them may be necessary to ensure that M is large enough. 

The conversion process from integer to modular representation and back (Steps 1 
and 3) requires time that is not needed in conventional integer multiplication (espe- 
cially Step 3, which may involve multiprecision techniques). But this need be done only 
once for each number, at input and output. The modular representation may be used 
for all intermediate calculations. It is much faster than direct computation with large 
integers, especially in a computer with parallel processing capability, which can work 
simultaneously in each Z,,. Under appropriate conditions the speed advantage in Step 2 
outweighs the disadvantage of the extra time required for Steps 1 and 3. For more 
details, see Knuth [31]. 


It is sometimes necessary to find an exact solution (not a decimal approximation) 
of a system of linear equations. When there are hundreds of equations or unknowns 
in the system and the coefficients are large integers, the usual computer methods will 
produce only approximate solutions because they round off very large numbers dur- 
ing the intermediate calculations. The Chinese Remainder Theorem is the basis of a 
method of finding exact solutions of such systems. 

Very roughly, the idea is this. Let m, ..., m, be distinct primes (and, hence, 
pairwise relatively prime).* For each m,, translate the given system of equations into 
a system over Z,, by replacing the integer coefficients by their congruence classes 
modulo m, Then solve each of these new systems by the usual methods (Gauss- 
Jordan elimination works equally well over the field Z,, as over R, and round-off is 
not a problem with the smaller numbers in Zm). Finally, use the Chinese Remainder 
Theorem and matrix algebra to convert these solutions modulo m, into a solution of 
the original system.t 


E Exercises 


A. 1. Assume that your computer has word size 100. Use the method outlined in 
the text to find the sum 123,684 + 413,456, using m, = 95, m, = 97, m, = 98, 
m4 = 99. 


2. (a) Find the least residue of 64,397 modulo 12, using only arithmetic in Z42- 
[Hint: Use Theorems 2.2 and 2.3 and the fact that 64,397 = 
(((6 + 10 + 4)10 + 3)10 + 9)10 + 7.] 


(b) Let n be a positive integer less than the word size of your computer and 
t any integer (possibly larger than the word size). Explain how you might 
find the least residue of ¢ modulo, using only arithmetic in Z, (and thus 
avoiding the need for multiprecision methods). 


“Considerations of size similar to those discussed above play a role in the selection of the m; 
‘This conversion is a bit trickier than may first appear. For instance, the system 
8x + Sy = 12 = 
agi becomes ARPES 


4x + Sy =10 4x +5y=3 sia 


You can verify that x = 4, y = 3is a solution of the Z, system. | t is not immediately clear how to get 
from this to the solution of the original system, which is x = 1/2, y = 8/5. 
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3. Use the method outlined in the text to represent 7 and 8 as elements of Z, X Zs. 
Show that the product of these representatives in Z, X Z; is (2, 1). If you use the 
Chinese Remainder Theorem as in the text to convert (2, 1) to integer form, do 
you get 56? Why not? This example shows why the method won’t work when the 
product of the m,is less than the answer to the arithmetic problem in question. 
Also see Exercise 5. 


B. 4. Let f:Z— Z, X Z, X Z; be given by f(t) = ([th, [4h [¢]s), where [f], is the 
congruence class of ¢ in Z,,. The function f may be thought of as representing t 
as an element of Z, X Z, X Z, by taking its least residues. 

(a) If 0 =r, s < 60, prove that /(r) = f(s) if and only if r = s. 
[Hint: Theorem 14.2.] 


(b) Give an example to show that if r or s is greater than 60, then part (a) may 
be false. 


. Let mı, M . . . , m, be pairwise relatively prime positive integers and 
SZ — Za, X Zm, X -+ X Zm, the function given by 


F(t) = tln iln- ++ Tla), 
where [t}„ is the congruence class of t in Z,,. Let M = mmz- + + m,. If 


0 =r,s < M, prove that f(r) = f(s) if and only if r = s. [Exercise 4 is a special 
case.] 


tn 


6. Assume Exercise 7(c). If your computer has word size 235, what m, might you 
choose in order to do arithmetic with integers as large as 2'* (approximately 
2.45 Xx. 1055)? 


C. 7. (a) If aand b are positive integers, prove that the least residue of 2* — 1 
modulo 2 — 1 is 2’ — 1, wherer is the least residue of a modulo b. 


(b) If a and b are positive integers, prove that the greatest common divisor of 
2* — 1 and 2 — 1 is 2' — 1, where t is the gcd of a and b. [Hint: Use the 
Euclidean Algorithm and part (a).] 


(c) Let a and b be positive integers, Prove that 2° — 1 and 2” — 1 are relatively 
prime if and only if a and b are relatively prime. 


14.3 | The Chinese Remainder Theorem for Rings 


The Chinese Remainder Theorem for two congruences can be extended from Z to 
other rings by expressing it in terms of ideals. The key to doing this is the definition of 
congruence modulo an ideal (Section 6.1) and the following fact: When A and B are 
idealsin aring R, theset of sums {a + b|ae A, b € B}is denoted 4 + Band is itself 
an ideal (Exercise 20 of Section 6.1). 

Let m and n be integers. Let I be the ideal of all multiples of m in Z and J the ideal 
of all multiples of n. Then congruence modulo m is the same as congruence modulo the 
ideal I. If (m, n) = 1, then mu + nv = 1 for some u, v € Z. Multiplying this equation by 
any integer r shows that m(ur) + n(ur) = r. Thus every integer is the sum of a multiple 
of m and a multiple of n, that is, the sum of an element of the ideal J and an element 
of the ideal J. Therefore, J + J is the entire ring Z. So the condition (m,n) = | amounts 
to saying I+ J = Z. 
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When (7, n) = 1, the intersection of the ideals Jand J is the ideal consisting of all 
multiples of mn (Exercise 6 of Section 14.1). So two integers are congruent modulo mn 
precisely when they are congruent modulo the ideal IN J. 

The italicized statements in the preceding paragraphs tell us how to translate the 
Chinese Remainder Theorem for two congruences into the language of ideals. By 
replacing the ideals in that discussion by ideals in any ring R, we obtain 


Theorem 14.3 Chinese Remainder Theorem for Rings 


Let / and J be ideals in a ring R such that / + J = R. Then for any a, b ER, the 
system 


x=a(mod /) 
x = b (mod J) 


has a solution. Any two solutions of the system are congruent modulo / N J. 


When R has an identity, the theorem can be extended to the case of r ideals A, 4,..., 
I, and congruences x = a, (mod J,), under the hypotheses that J, + J, = R whenever i + j 
(see Exercise 6 and Hungerford [5; p. 131). 


Proof of Theorem 14.3> Since 7+ J = Rand b — ae R, there exist i € JEJ 
such that į + j = b — a. Hence, a + i = b — j. Let t = a + i; then 


t-a=(a+i-a=iel, 
so that t = a (mod 2). Similarly, since a + į = b — f 
t-b=(a+ġÌ-b=(b-j)-b=-jEJ. 


Hence, t = b (mod J), and f is a solution of the system. If z is also a 
solution, then 


z =a (mod J) and t = a (mod J) imply that z = ł (mod J) 


by Theorem 6.4. Similarly, z = ¢ (mod J). This means that z — te Tand 
z — tE J. Therefore, z — te I N Jand z = t (mod IN J). E 


One consequence of the Chinese Remainder Theorem is a useful isomorphism of 
rings. 


Theorem 14.4 


If / and J are ideals in a ring Rand / + d = R, then there is an isomorphism 
of rings 


RKE N J) = R/T X RIJ. 
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Proof > Define a map f:R—> R/I X R/J by f(r) = (r + I, r + J). Then fisa 
homomorphism because 
LO +f/9NHC+Lr+I+G6+hs+J) 
=(r+H)+ hr +9)+ HD =f(rt+s) 


and 


JMNM = (r+ hr+No+hst+N 
= (rs + I, rs + J) = firs). 


To show that fis surjective, let (a + I, b + J)ER/I X R/J. We must find 
an element of R whose image under fis (a + I, b + J). By Theorem 14.3 
there is a solution ¢ E R for this system: 

x =a (mod J) 

x = b (mod J). 


But ¢t = a (mod J) implies that t + I = a + I by Theorem 6.6. Similarly, 
t = b (mod J) implies t + J = b + J, so that 


MO=0+Lt+)=(a+h5b+/). 


Therefore, f is surjective. 

Let K be the kernel of f. By the First Isomorphism Theorem 6.13, R/K 
is isomorphic to R/I X R/J. Now K consists of all elements r € R such 
that f(r) is the zero element in R/I X R/J, that is, all r such that 


(r+I,r+ J) = (Og + I Og + J), 
or equivalently, 
r+I=0}+I and r+J=0g+ J. 


But r + I = Og + I means that r = Opg (mod 7), and, hence, re F. 
Similarly, r + J = 0, + J implies r E J. Therefore, re I N J. So IN Jis 
the kernel of f, and R/(I N J) = R/Ker f= R/I X R/J. M 


Corollary 14.5 


If (m, n) = 1, then there is an isomorphism of rings Zmn = Zm X Zp: 


Proof > tn the ring Z, the ideal (m) consists of all multiples of m and the ideal 
(n) of all multiples of n. The first three paragraphs of this section show 
that (m) + (n) = Z and that (m) N (n) is the ideal (mn) of all multiples 
of mn. Furthermore, the quotient rings Z/ (mn), Z/(m), and Z /(”) are, 
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respectively, Z mn Zm and Z,. Therefore, by Theorem 14.4 (with R = Z, 
I = (m), J = (n)) there is an isomorphism 


Zon = Z/(mn) = Z/((m) N (1) = Z/(m) X Z/(n) = Zm X Z, 


Corollary 14.6 


If n = pi™p: p3” - p", where the p, are distinct positive primes and each 
n; > 0, then there is an isomorphism of rings 


Z, =Z n X Zam X Pam X +++ X Lym 


Proof > since the p; are distinct primes, p,” and the product ph} i- - -p," are rela- 
tively prime for each i. So repeated use of Corollary 14.5 shows that 


Z, = Zin X Eorp" vo pa = Zo X Zon X Zon a HE ee 
= Ay» X Lyn X Lyn X -t X Lys. 


E Exercises 


A. 1. (a) Show that Z; X Z2 is isomorphic to Z} X Zy. 
(b) Is Z, X Zs isomorphic to Z; X Zz? 
2. If I and J are ideals in a ring R and aET, b EJ, show thatabeIn J. 

B. 3. If (m, n) # 1, show that Zm, is not isomorphic to Z,, X Z,. [Hint: If (m, n) = d, 
then m is an integer (Why?). If there were an isomorphism, then 1 € Zna 
would be mapped to (1, R EZ m X Z,, Reach a contradiction by showing that 

me 1# 0 in Zom but “> - (1,1) = (0,0) in Zm x Z,.] 


4. Which of the following rings are isomorphic: Z, X Z, X Z3, Z3 X Z4 X Z» 
Zsa, Z3 X Zi, Z2 X Z3 X Zia Z4 X Ly? 

5. If Z, h, 4 are ideals in a ring R with identity such that J, + 4 = R and & + h = 
R, prove that (J, N F) + 4 = R. [Hint: If re R, thenr = i + hand lg = h + t; 
for some i, E h, 2 Eh, and h, h E B. Then r = (4 + A)(tz + t3); multiply this out 
to show that r is in 7, N 4) + B. Exercise 2 may be helpful.] 

6. Let J, h, J; be ideals in a ring R with identity such that J, + Z = R whenever 
i + j. If a€ R, prove that the system 

x = a, (mod fj) 
x = a, (mod J) 
x = a (mod J;) 
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has a solution and that any two solutions are congruent modulo 4 N R A h. 
[Hint: If s is a solution of the first two congruences, use Exercise 5 and 
Theorem 14.3 to show that the system 


s (mod J, N A) 
a, (mod h) 


has a solution, and it is a solution of the original system.] 


x 
Mill 
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CHAPTER 1) 


Geometric Constructions 


Prerequisites: Sections 4.1, 4.4, and 4.5. 


Since the sixth century B.c., mathematicians have studied geometric construc- 
tions with straightedge (unmarked ruler) and compass. Despite their prowess in 
geometry, the ancient Greeks were never able to perform certain constructions 
using only straightedge and compass, such as 


Duplication of the Cube: Construct the edge of a cube having twice the 
volume of a given cube.* 


Trisection of the Angle: Construct an angleonethird the size of a given angle. 


Squaring the Circle: Construct a square whose area is equal to the area of 
a given circle. 


Finally in the last century it was proved that each of these constructions is impos- 
sible. This chapter presents an elementary proof of the impossibility of the first 
two constructions listed above (the third is discussed in Exercise 21). 


Many people remain fascinated by these problems, particularly angle trisection, 
and continue to publish what they say are “solutions,” even though it has been proved 
that there are none (see, for example Dudley [37]). Consequently, it is important to 
understand just what we claim is impossible here and what constitutes a proof. 

The ancient Greeks knew that all the constructions listed above could readily be car- 
ried out provided that additional tools were permitted. For instanoe, any angle can be 
trisected using a compass and straightedge with just one mark on it. The Greeks also 


*This problem supposedly had its origin in an ancient legend: Athens was afflicted by a plague and 
its people were told by the oracle at Delos that the plague would end when they built a new altar 
to Apollo in the shape of a cube that had twice the volume of the old altar, which was also a cube. 


459 
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knew that some angles, such as 9®°, can be trisected by straightedge and compass alone 
(Exercise 3). So the issue is not whether these constructions can ever be performed, but 
whether they can be performed in every possible case using only an (unmarked) straight- 
edge and a compass. Furthermore, physical measurement alone is not sufficient to jus- 
tify such constructions because no measuring device is absolutely accurate. Justification 
requires a valid mathematical proof based on accepted principles and the rules of logic. 
The key to the impossibility proofs presented here (and to every other known proof 
of these facts) is to translate the geometric problem into an equivalent algebraic one. 
Under this translation process, as we shall see, constructions with a straightedge cor- 
respond to solving linear equations and constructions with a compass to solving qua- 
dratic equations. Before we can begin this translation process, we present a typical 
straightedge-and-compass construction to give you a feel for what we are dealing with. 


EXAMPLE 1 


Given points O and P, construct a line perpendicular to line OP through O as 
follows. Construct the circle with center O and radius OP; it intersects line OP 
at points R and P, as shown on the left side of Figure 1. Segments OR and OP 
are radii of the circle and thus have the same length. Now construct the circle 
with center R and radius RP and the circle with center P and radius RP. These 
circles intersect in points A and B as shown in the center of Figure 1. Segments 
RP, RA, and PA have the same length. (Why?) 


FIGURE1 


Draw the line AO. In triangle RAP, shown on the right of Figure 1, the sides 
RA and PA are congruent, as are the sides OR and OP. Side OA is congruent to 
itself. Therefore, triangles ORA and OPA are congruent by side-side-side. Since 
angles ROA and POA are congruent and supplementary, each of them must 
be a right angle. Therefore, line AO is perpendicular to line OP at O. 


Outline of the Argument 


Now we begin the translation from geometry to algebra. The following outline should 
help you to see where we’re headed and to keep things straight as we go along. The 
capitalized headings here correspond to the headings on the subsections below. 


CONSTRUCTIBLE POINTS We begin with any two points and determine 
what additional points can be constructed from them by straightedge-and-compass 
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constructions; these are the constructible points. Next we use the distance between the 
original two points as the unit length and coordinatize the plane. 


CONSTRUCTIBLENUMBERS A number rissaid to be constructible if the 
point (r, 0) is a constructible point. We then examine the equations of lines and circles 
determined by constructible points and the coordinates of their intersection points. 
This leads to a characterization of constructible numbers in terms of certain subfields 
of R and square roots of positive elements of R. 


ROOTS OF POLYNOMIALS = Thecharacterization of constructible numbers 
is then used to show that certain cubic polynomials have no constructible numbers as 
roots. 


IMPOSSIBILITY PROOFS Finally, we demonstrate the impossibility of the 
constructions in question by using proof by contradiction: If the construction were 
possible, then one of the cubic polynomials mentioned in the preceding paragraph 
would have a constructible number as a root, which is a contradiction. 


Constructible Points 


We first give a formal mathematical description of straightedge-and-compass con- 
structions, such as those in Example 1, that begin with two points O and P. Let S be 
the set {O, P}. Form the line determined by the two points of S. Form the two circles 
with centers O and P and radius OP. Let S, be the set of all points of intersection of 
this line and these circles, together with the points O, P in the original set S. Repeat 
this process with S}. Form every line determined by pairs of points in §,. Form every 
circle whose radius is the distance between some pair of points in S, and whose center 
is a point in S+. Let S, be the set of all points of intersection of these lines and circles, 
together with the points in S}. Repeat the process with §,. Continuing in this way pro- 
duces a sequence of sets 


S18, 2G, SSS w 


A constructible point is any point that lies in some S;. A constructible line is a line that 
contains at least two constructible points. A constructible circle is one whose center is 
a constructible point and whose radius has length equal to the distance between some 
pair of constructible points. For example, all the labeled points and all the lines and 
circles in Figure | are constructible. Note that points of intersection of constructible 
lines and circles are constructible points. 

Now we coordinatize the plane by taking O as the origin, the distance from O to 
P as the unit length, and the line OP as the x-axis, and P having coordinates (1, 0). 
Figure 1 shows that the y-axis (the line AO) is a constructible line. The point (0, 1) is 
constructible since it is the intersection of the y-axis and the constructible circle with 
center O and radius OP. A similar argument shows that 


(r, 0) is constructible if and only if (0, r) is constructible. 


Constructible Numbers 


A real number r is said to be a constructible number if the point (r, 0) is a constructible 
point. Every integer is a constructible number (Exercise 4). If r is the distance between 
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two constructible points A and B, then r is a constructible number because (r, 0) is the 
intersection of the constructible x-axis and the constructible circle with center O and 
radius r. Exercise 18 shows that 


a point is constructible if and only if its coordinates are 
constructible numbers. 


Theorem 15.1 


Let a, b, c, d be constructible numbers with e # 0 and d > 0. Then each of 
a+b,a—b,ab,a/c,and V4 is a constructible number. 


Proof» We first assume a and c are positive and show that a/c is a constructible 
number. Since a and c are constructible numbers, the points (a, 0) and 
(0, ¢) are constructible and so is the line L they determine. The line 
through the constructible point (0, 1) parallel to L is constructible 
(Exercise 19). It intersects the x-axis at the constructible point (x, 0), as 
shown on the left side of Figure 2. Hence, x is a constructible number. 


a ; 1 . 
Use similar triangles to show that =. =, which implies that x = a/c. 


When a = 0 or when a or c is negative, Exercise 13 shows that a/c is a 
constructible. 


FIGURE 2 


If b = 0, then ab = 0 is certainly constructible. If b # 0, then 1/b is 
constructible by the previous paragraph, and hence a/(1/b) = ab is also 
constructible. Exercise 2 shows that a + b and a — b are constructible. 

The number d + 1 is constructible by Exercise 2. So the midpoint A 
of the line segment joining the constructible points (0, 0) and (d + 1, 0) 
is constructible (Exercise 20). Hence, the circle with center A and radius 
(d + 1)/2 is constructible. The constructible line that is perpendicular 
to the x-axis at the point (1, 0) intersects this circle at the constructible 
point B = (1, y), as shown on the right of Figure 2. A theorem in plane 
geometry states that an angle that is inscribed in a semi-circle (such as 
OBD) is a right angle. Use the three right triangles on the right side 
of Figure 2 and the Pythagorean Theorem to show that y? = dand, 
therefore, y = Vd. It follows that y = Vd is a constructible number. m 
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Corollary 15.2 


Every rational number is constructible. 


Proof» Every integer is constructible (Exercise 4). Therefore, every quotient of a 
pair of integers (rational number) is constructible by Theorem 15.1. E 


In order to determine exactly which real numbers are constructible, we must examine 
the equations of constructible lines and circles. 


Lemma 15.3 


Let F be a subfield of the field R of real numbers. 


(1) Ifa line contains two points whose coordinates are in F, then the line 
has an equation of the form 


ax + by +c =0, where a, 6, cEF. 

(2) If the center of a circle is a point whose coordinates are in F and the 
radius of the circle is a number whose square is in F, then the circle 
has an equation of the form 

xX +y+uxtsy+ t=O, where r, s, tEF. 
Proof» (1) Suppose (x1, yı) and (x2, y3) are points on the line with x, y,¢ F. If 


X, # X, the two-point formula for the equation of a line shows that the 
line has equation 


bay 
=== 4 oe 
Ja LN eee i Tame) = 
Cie pa TE =) n] : 
ax + by + c =0 


Since F isa field and x, y; € F, each of æ, b, c is in F. The case when xı = x2 
is left to the reader. 


(2) If (xı yi) is the center and k the radius, with x, Yı, K” EF, then 
the equation of the circle is 


(x- a) + (y -yY = e 
x? + y? + (—2x,)x + (—2y,)y + [xt + y? — k = 0. 


The coefficients are in F. m 
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Lemma 15.4 
Let F be a subfield of R and k a positive element of F such that VkeF. Let 
F(Vk) be the set {a + Vk ja, b € F}. Then 
(1) F(VK) is a subfield of R that contains F. 
(2) Every element of F(Vk) can be written uniquely in the form a + bwk, 
witha, DEF. 
Proof» (1) Exercise 15. 


(2) Ifa+ bVk = a, + bi Vk, with a, b, a, b, EF, then a — a, = 
(bı — b) Vk. If b — b, + 0, then Vk = (a — a1) (bı — bX}, which is an 
element of F. This contradicts the fact that Vk ¢ F. Hence, b, — b, = 0, 
and, therefore, a — a, = (0) Vk =0.Thusa=a,andb=5,. Em 


The field F(VÝ is called a quadratic extension field of F. Quadratic extension fields 
play a crucial role in determining which numbers are constructible. 


Lemma 15.5 
Let F bea subfield of R. Let £; and Ł; be lines whose equations have coefficients 
in F. Let C, and C, be circles whose equations have coefficients in F. Then 

(1) If 4 intersects L, then the point of intersection has coordinates in F. 


(2) If Č, intersects C}, then the points of intersection have coordinates 
in F or in some quadratic extension field F(Vk). 


(8) If £; intersects C,, then the points of intersection have coordinates 
in F or in some quadratic extension field F(Wk). 


Proof» (1) Suppose L, and L, have equations 
Dy:ayx + by = cy 
Lax + by = c 


with a, bn cE F. Since L; intersects L,, these equations have a simulta- 
neous solution. By using elimination or determinants, we see that this 


solution is 
_ be — dyer _ AC, — Mti 
=a id 9 SS 
ab, — abi abı — ab, 
Since a,, b, cE F, the point of intersection (x, y) has coordinates in the 
field F. 


(2) Suppose C, and C, have equations 
CQ:irty+nx tay +t =0 
Curt y+ nx + sy +t, =0 
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with r; 5; t E€ F. The coordinates of the intersection points satisfy both 
equationsand, hence, must satisfy the equation obtained by subtracting 
the second equation from the first: 


(ry — ra)x + (Sy — Soy + (4, — fy) = 0. 
This is the equation of a line, and its coefficients are in F. Since the inter- 


section points of C, and C; lie on this line and on the circle C}, we need 
only prove (3) to complete the proof of the theorem. 


(3) Let L, and C, have the equations given above. At least one of 
4, bı must be nonzero, say b, # 0. Solve the equation of L, for y and 
substitute this result in the equation for C}. Verify that this leads to an 
equation of the form ax’ + bx + c = 0, witha, b, ce F. The solutions of 


this equation are 
re EE A, gk ie 


where A = —b/2a, B= 1 be and k = b? — dac are elements of F. Since 
L, and G; intersect, we know that k = 0. Using the equation for L4, we 
see that the coordinates of the points of intersection of ZL, and C; are 


Gr aA _ a,B 


x=A+BVK and y= Vie 


b bi 
x= A- BVk and yo L EN 
1 1 


If k = 0, these reduce to a single point of intersection. Since b, # 0, all 
these coordinates lie either in F (if Vk € F) or in the quadratic extension 


F(V&) (if Vk@F). m 
Theorem 15,6 


If a real number r is constructible, then there is a finite chain of fields 
Q =F, oF, cf,¢--+SF,¢Rsuch that reF, and each £ is a quadratic exten- 
sion of the preceding field, that is, 


F, = QV), Fa = F,(Ve;) Fy = FV Gq), «061 Fn = Fnk VEn) 
where c, E F but Væ ¢ fori =0,1,2,...,9-1. 


A finite chain of fields as in the theorem is called a quadratic extension chain. 


Proof of Theorem 15.6 > Let r be a constructible number. Then the point (r, 0) can 
be constructed from the points O = (0, 0) and P = (1, 0) by a finite 
sequence of operations of the following types: 


(i) Form the line determined by A and B, where A, B are previously 
constructed points or elements of {O, P}; 
(ii) Form the circle with center A and radius the distance from B to C, 
where A, B, C are previously constructed points or elements of {O, P}; 
(iii) Determine the points of intersection of lines and circles formed in 
(i) and (ii). 
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This process begins with the points O and P whose coordinates are in Q. 
Lines or circles determined by them will have equations with rational 
coefficients by Lemma 15.3. The intersections of such lines and circles 
will be points whose coordinates are either in Q or in some quadratic 
extension Q(Veq) by Lemma 15.5. The lines and circles determined by 
these points will have equations with coefficients in the field F} = Q( Vc) 
by Lemma 15.3. The intersections of such lines and circles will have 
coefficients either in F, or in some quadratic extension F,(Ve,) by 
Lemma 15.5. Continuing in this fashion, we see that at each stage of the 
construction of (r, 0) the points in question have coordinates in some 
field F, and at the next stage the newly created points have coordinates in 
F, or in a quadratic extension F({Vc). After a finite number of such steps 
we reach the point (r, 0), which necessarily has coordinates in the last 
field of the quadratic extension chain Q = WCF, SFS SF, E 


Roots of Polynomials 


There are two ways to show that some real numbers are not constructible. The method 
presented here is elementary and depends only on Chapter 4. But if you’ve covered 
Sections 11.1 and 11.2, skip to Theorem 15.9 and use the footnote below in place of 
the proof given there.* 


Lemma 15,7 


Let F be a subfield of R and f(x)eF{x]. Suppose that keF but Wk ¢F. 
If a + bvk is a root of f(x), then a — bvk is also a root of f(x). 


Proof» if u=r+sVkEF (Vk), let u denote r — sk. This operation is well 
defined because every element of F( Vk) can be written uniquely in the 
form r + sVk(r, se F) by Lemma 15.4. Verify that for any u, v e F( Vk), 
(u + v) = u + vand w = u ° v. Also note that u = u if and only if s = 0, 
that is, if and only if u € F. The rest of the proof is identical to the 
proof of Lemma 4.29, which is the special case when F = R, k = —1, 
and Vk =i. m 


Lemma 15.8 


Let F be a subfield of a field K. Let f(x), g(x) e Fix] and A(x) EAP]. If f(x) = 
g(x)h(x), then A(x) is actually in F[x]. 


*IfkeF and Vee F, then x? — k € F[x] is the minimal polynomial of Vk over F, and, hence, 
IE(VK):F] = 2 by Theorem 11.7. If Q S- - - & F, is a quadratic extension chain, then [F,:Q] must be 
a power of 2 by Theorem 11.4. Therefore, the minimal polynomial of a constructible number u has 
degree 2 for some k (since this degree is the dimension [Q(u):Q], which must divide [F,,:Q]). 
Consequently, no constructible number can be the root of an irreducible cubic in Q[x]. Since a 
cubic polynomial in Q[x]} with no rational roots is irreducible by Corollary 4.19, no such polynomial 
can have a constructible number as a root. 
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Proof» By the Division Algorithm in F[x], there are polynomials k(x) and r(x) 
in F[x] such that f(x) = g(x)k(x) + r(x), with r(x) = 0 or 
deg r(x) < deg g(x). Since F & K, all these polynomials are in K [x]. Now 
consider the Division Algorithm in K[x], which says that there is a unique 
quotient and remainder. We have f(x) = g(x)k(x) + r(x), and by hypoth- 
esis we also have f(x) = g(x)h(x) + 0. By uniqueness, we must have 
r(x) = 0 and A(x) = k(x). Since k(x) € F[x], the lemma is proved. E 


Theorem 15.9 


Let f(x) be a cubic polynomial in Qix]. If f(x} has no roots in Q, then f(x) has no 
constructible numbers as roots. 


The theorem implies, for example, that WZ is not a constructible number because it is 
a root of x’ — 2, which has no rational roots by the Rational Root Test (Theorem 4.21). 


Proof of Theorem 15.9> Suppose on the contrary that f(x) has real roots that are 
constructible. Each such root lies in a quadratic extension chain of Q by 
Theorem 15.6. Among all the quadratic extension chains containing a 
root of {(x), choose one of the smallest possible length, say Q = F S 
F S» -S Fa This means that f(x) has a root rin F, and that no qua- 
dratic extension chain of length n — 1 or less contains any root of 
fœ). Note that F, + Q since f(x) has no rational roots. By the Factor 
Theorem 4.16 f(x) = (x — r)t(x) for some t(x) € F,[x]. Now r € F,, and 
by the definition of a quadratic extension chain F, = F,_s(Vk) for some 
keF,_, with Vk € F,_;. Therefore r = a + bVk with a, bE F,_}. We 
must have b # 0; otherwise, r would be in the chain FoS F, S++: CF, y, 
contradicting the fact that f(x) has no roots in a chain of length n — 1. 
By Lemma 15.7 7 = a — bV<kis also a root of f(x) = (x — r(x). Since 
7 + r (because b # 0)7 must be a root of (x). By the Factor Theorem 


f(x) = (x — N(x — AACx) for some A(x) E€ Fx]. 
Let g(x) = (x — r)(x — r) and observe that the coefficients of g(x) are in 


al 


a(x) = (x — (a + bVK))(x — (a — bVK)) = 2 - 2ax + (@ — kb». 
Therefore, f(x) = g(x)h(x) with f(x), (x) E F,_:[x]. Consequently, 
A(x) F,_ [x] by Lemma 15.8. Now f(x) has degree 3 and g(x) has 
degree 2, so A(x) must have degree 1 by Theorem 4.2. Since every first 
degree polynomial over a field has a root in that field, A(x)—and, hence, 
f(x)—has a root in F,_}. This contradicts the choice of A SFS + > SF, 
as a quadratic extension chain of minimal length containing a root of f(x). 
Therefore, f(x) has no constructible numbers as roots. W 


Impossibility Proofs 


Finally, we are in a position to prove the impossibility of the constructions discussed 
at the beginning of the chapter. In what follows, it is assumed that whenever a point, 
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line radius, etc., may be chosen arbitrarily, a constructible point, line, radius, etc., will 
be chosen. This guarantees that all points, lines, etc., produced by the construction 
process will be constructible ones. 


DUPLICATION OF THE CUBE Label the endpoints of one edge of the 
given cube as O and P and use this edge OP as the unit segment for coordinatizing 
the plane. Since the given cube has side length 1, its volume is also 1. If there were 
some way to construct with straightedge and compass the side of a cube of volume 2, 
then the length ¢ of this side would be a constructible number such that c? = 2. 
Thus ¢ would be a root of x’ — 2. But this polynomial has no rational roots by 
the Rational Root Test and, hence, no constructible ones by Theorem 15.9. This 
contradiction shows that duplication of the cube by straightedge and compass is 
impossible, 


TRISECTION OF THE ANGLE It suffices to prove that an angle of 60° 
cannot be trisected by straightedge and compass. Choose two points O, P and 
coordinatize the plane with O as origin and P = (1, 0). The point Q = (1/2, 3/2) 
is constructible since its coordinates are constructible numbers by Theorem 15.1 and 
Corollary 15.2. Furthermore, Q lies on the unit circle xX? + y? = 1. Therefore, angle 
POQ has cosine 1/2 (the first coordinate of Q) and, hence, has measure 60°. If it were 
possible to trisect this angle with straightedge and compass, there would be a 
finite sequence of constructions that would result in a constructible point R such that 
the angle ROP has measure 20°, as shown in Figure 3. 


FIGURE 3 


The point T where the constructible line OR meets the constructible unit circle is 
a constructible point. Hence, its first coordinate, which is cos 20°, is a constructible 
number. Therefore, 2 cos 20° is a constructible number by Theorem 15.1. But for any 
angle of t degrees, elementary trigonometry (Exercise 5) shows that 


cos 3t = 4 cos? t — 3 cos t. 
If ¢ = 20°, then this identity becomes 


cos 60° = 4 cos? 20° ~ 3 cos 20° 


l s 
Plas 4 cos? 20° — 3 cos 20°. 
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Multiplying by 2 and rearranging, we have 
(2 cos 20°)? — 3(2 cos 20°) — 1 = 0. 


Thus the supposedly constructible number 2 cos 20° is a root of xX? — 3x — 1. The 
Rational Root Test shows that his polynomial has no rational roots and, hence, no 
constructible ones by Theorem 15.9. This is a contradiction. Therefore, an angle of 60° 
cannot be trisected by straightedge and compass. 


E Exercises 


A. 1. Prove that ris a constructible number if and only if ~r is constructible. 


2. Let a, b be constructible numbers. Prove that a + banda — b are 
constructible. 


3. Use straightedge and compass to construct an angle of 


(a) 30° (b) 45° 
(c) Show that angles of 90° and 45° can be trisected with straightedge and 
compass. 


4. Prove that every integer is a constructible number. [Hint: 1 is constructible 
(Why?): construct a circle with center (1, 0) and radius 1 to show 2 is 
constructible.] 


5. Prove that cos 3t = 4 cos? t — 3 cos t. [Hint: These identities may be helpful: 
(1) cos(t, + t) = cos ¢ cos t, ~ sin t, sin tz; (2) cos 2t = 2 cos? t — | and 
sin 2¢ = 2 sin t cos t; (3) sin? t + cos? t = 1] 

6. Is it possible to trisect an angle of 3t degrees if cos 3t = 1/3? What if 
cos 3¢ = 11/16? 


B. 7. Consider a rectangular box with a square bottom of edge x and height y. 
Assume the volume of the box is 3 cubic units and its surface area is 7 square 
units. Can the edges of such a box be constructed with straightedge and 
compass? 


8. Use straightedge and compass to construct a line segment of length 1 + V3, 
beginning with the unit segment. 


9. Is it possible to construct with straightedge and compass an isosceles triangle 
of perimeter 8 and area 1? 


10. (a) Prove that the sum of two constructible angles is constructible. 
[A constructible angle is an angle whose sides are constructible lines.] 


(b) Prove that it is impossible to construct an angle of 1° with straightedge 
and compass, starting with the unit segment. [Hint: If it were possible, 
what could be said about an angle of 20°77] 


11. Prove that an angle of t degrees is constructible if and only if cos tis a 
constructible number. 
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12. 


13. 


16. 


17. 


18. 


19. 


20. 


C21. 
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Prove that r is a constructible number if and only if a line segment of length 
|r[can be constructed by straightedge and compass, beginning with a segment 
of length 1. 


Let a, c be constructible numbers with c + 0. Prove that a/c is constructible. 
[Hint: The case when a > 0, ¢ > 0 was done in the proof of Theorem 15.1.] 


. Prove that the set of all constructible numbers is a field. 
. Let F be a subfield of R and k € F. Prove that (Vk) = {a + bVkla, be F} is 


a subfield of C that contains F. If k > 0, show that F is a subfield of R. 
[Hint: Adapt the hint for Exercise 39 in Section 3.1.] 


Prove the converse of Theorem 15.6: If r is in some quadratic extension chain, 
then r is a constructible number. [Hint: Theorem 15.1 and Corollary 15.2.] 


Let C be a constructible point and L a constructible line. Prove that the line 
through C perpendicular to L is constructible. [Hint: The case when C is on 
L was done in Example 1. If C is not on Land D is a constructible point on 
L, the circle with center C and radius CD is constructible and meets L at the 
constructible points D and E. The circles with center D, radius CD and center 
E, radius CE intersect at constructive points C and Q. Show that line CQ is 
perpendicular to L.] 


Prove that (r, s) is a constructible point if and only if r and s are constructible 
numbers. [Hint: The lines through (r, s) perpendicular to the axes are 
constructible by Exercise 17.] 


Let A be a constructible point not on the constructible line L. Prove that the 
line through A parallel to L is constructible [Hint: Use Exercise 17 to find a 
constructible line M through A, perpendicular to L. Then construct a line 
through A perpendicular to M.] 


Prove that the midpoint of the line segment between two constructible points 
is a constructible point. [Hint: Adapt the hint to Exercise 17.] 


Squaring the Circle Given a circle of radius r, show that it is impossible to 
construct by straightedge and compass the side of a square whose area is the 
same as that of the given circle. You may assume the nontrivial fact that 77 is 
not the root of any polynomial in Q[x]. 


CHAPTER 16 


Algebraic Coding Theory 


Prerequisites: Section 7.4 and Appendix F for Section 16.1; Section 8.4 
for Section 16.2; Section 11.6 for Section 16.3. 


Coding theory deals with the fast and accurate transmission of messages over 
an electronic “channel” (telephone, telegraph, radio, TV, satellite, computer relay, 
etc.) that is subject to “noise” (atmospheric conditions, interference from nearby 
electronic devices, equipment failures, etc.). The noise may cause errors so 
that the message received is not the same as the one that was sent. The aim of 
coding theory is to enable the receiver to detect such errors and, if possible, to 
correct them.* 

The use of abstract algebra to solve coding problems was pioneered by 
Richard W. Hamming, whose name appears several times in this chapter. In 1950 
he developed a large class of error-correcting codes, some of which are presented 
here. 


16.1 Linear Codes 


Verbal messages are normally converted to numerical form for electronic transmis- 
sion. When computers are involved, this is usually done by means of a binary code, 
in which messages are expressed as strings of 0’s and I’s. Such messages are easily 


“Thus coding theory has virtually no connection with the secret codes discussed in Chapter 13. 
The purpose of the latter was to conceal the message, whereas the purpose here is to guarantee 
its clarity. 
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handled because the internal processing units on most computers represent letters, 
numerals, and symbols in this way. The discussion here deals only with such binary 
codes.* 

Throughout this chapter we assume that we have a binary symmetric channel, 
meaning that: 


1. The probability of a 0 being incorrectly received as a 1 is the same as the 
probability of a 1 being incorrectly received as a 0; 


2. The probability of a transmission error in a single digit is less than .5; and 


3. Multiple transmission errors occur independently.t 


Here is a simple example that gives a flavor of the subject. 


EXAMPLE 1 


Suppose that the message to be sent is a single digit, either 1 or 0. The mes- 
sage might be, for example, a signal to tell a satellite whether or not to orbit a 
distant planet. With a single-digit message, the receiver has no way to tell if an 
error has occurred. But suppose instead that a four-digit message is sent: 1111 
for 1 or 0000 for 0. Then this code can correct single errors. For instance, if 
1101 is received, then it seems likely that a single error has been made and that 
1111 is the correct message. It’s possible, of course, that three errors were made 
and the correct message is 0000. But this is much less likely than a single error. 
The code can detect double errors, but not correct them. For instance, if 1100 
is received, then two errors probably have been made, but the intended message 
isn’t clear. 


Example 1 illustrates in simplified form the basic components of coding theory. 
The numerical message words (0 and 1) are translated into codewords (0000 and 1111). 
Only codewords are transmitted, but in the example any four-digit string of 0’s and 1’s 
is a possible received word. By comparing received words with codewords and decid- 
ing the most likely error, a decoder detects errors and, when possible, corrects them.** 
Finally, the corrected codewords are translated back to message words, or an error is 
signaled for received words that can’t be corrected. 

Now consider Example 1 from a different viewpoint. Think of the message words 0 
and 1 as elements of Z>, and the received words as the additive group Z3 X Z3 X Z3 X Z3 
(with its elements written as 4-digit strings of 0’s and 1’s). Using Theorem 7.12, you 


*“Binary” refers to the fact that these codes are based on Z,. Although binary codes are the most 
common, other codes can be constructed by using any finite field in place of Z- 

1The accuracy rate of message transmission depends on these probabilities. Since elementary 
probability is not a prerequisite for this book, our discussion of such questions will be minimal; see 
Exercises 27-31. 

Sif the probability of receiving a wrong digit is .01, then three or four errors occur ina message word 
less than .0004% of the time (once in 250,000 transmissions); see Exercise 27. 


**This is sometimes called maximum-likelihood decoding. 
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can easily verify that the set of codewords {0000, 1111} is asubgroup of order 2 of the 
received words, as shown schematically here: 


Message Words Codewords Received Words 
Z Z, X Z, X Z, X Z, 
0 0000 
1111 


Next, we extend these ideas to the general case. For each positive integer n, 
Bin) denotes Z, x Z, X Z, X +- X Z, (n copies). 


With coordinatewise addition, B(n) is an additive group of order 2” (Exercise 10). The 
elements of B(n) will be written as strings of 0’s and 1’s of length x. 


Definition lf 0-< k <n, then an (n, k) binary linear code consists of a subgroup C of 


A(n) of order 2. 


For convenience, C is often called an (n, k) code, a linear code, or just a code.* The 
elements of C are called codewords. Only codewords are transmitted, but any element 
of B(n) can bea received word. 

The code in Example 1 is C = {0000, 1111}, a subgroup of order 2! of the group 
B(4) = Z, X Z, X Z, X Z, of order 24. So this is a (4, 1) code, in which the set of 
message words is B(1) = Z}. Similarly, in the general case of an (7, k) code, we shall 
consider B(k) = Z, X Z, X Z, X +- - X Z, (k copies of Z3), which has order % to be 
the set of message words. 

Although any method of assigning each message word to a unique code word can 
be used, the assignment made in Example | is convenient because the first digit in each 
code word is the corresponding message word: 0 —> 0000 and 1 — 1111. The (n, k) codes 
discussed below have the same feature: The first k digits of an n-digit codeword form 
the corresponding message word. 


EXAMPLE 2 


We shall construct the (6, 5) parity-check code. The message words are the ele- 
ments of B(5), that is, all five-digit strings of 0’s and 1’s. A message word is con- 
verted to a codeword (element of B(6)) by adding a sixth digit to the string; the 
extra digit is the sum (in Z3) of the digits in the message word. For instance, if 
the message word is 11011, then 1 + 1 +0 + 1 + 1 = 0, so the corresponding 
codeword in B(6) is 110110. Similarly, the message word 10101 € B(5) has 
1+0+1+0+ 1 = 1, so the corresponding codeword is 101011 € B(6). 

An element of B(6) is a codeword if and only if the sum of its digits is 
0. [Reason: If the sum of the message-word digits is 0, a 0 is added to make 
the codeword; if the sum of the message-word digits is 1, a 1 is added for the 


*Linear codes are also called block codes or group codes. 
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codeword and 1 + 1 = 0; see Exercise 12 for the converse.] Using this property, it 
is easy to show that the set C of codewords is a subgroup of B(6) (Exercise 13). 

This code can detect single transmission errors (1 is received as 0 or 0 as 1) 
because the sum of the digits in the received word is 1 instead of 0. The same 
is true for any odd number of errors. But it cannot detect an even number of 
errors, nor can it correct any errors. For each n = 2, an (n, n — 1) parity-check 
code can be constructed in the same way. 


When retransmission of messages is easy, a parity-check code can be very useful. 
Such codes are frequently used in banking and in the internal arithmetic of computers. 
But when retransmission is expensive, difficult, or impossible, an error-correcting code 
is more desirable. We now develop the mathematical tools for determining the number 
of errors a code can detect or correct. 


Definition The Hamming weight of an element u of A(n) is the number of nonzero 
coordinates in y; it is denoted Wt(u). 
EXAMPLE 3 
If u = 11011 in B(5), then Wt(z) = 4. Similarly, v = 1010010 € B(7) has weight 
3, and 0000000 has weight 0. 
Definition 


Let u, v eB(n). The Hamming distance between u and y, denoted cfu, v), is 
the number of coordinates in which u and v differ.* 


EXAMPLE 4 


If u = 00101 and v = 10111 in B(5), then du, v) = 2 because u and v differ in 
the first and fourth coordinates. In B(4) the distance between 0000 and 1111 is 4. 


Lemma 16.1 
If u, v, wEB(n), then 
(1) d(u, v) = Wt(u — v); 
(2) du, v) = d(u, w) + dw, v). 
Proof» (1) A coordinate of u — vis nonzero if and only if u and v differ in that 
coordinate. So the number of nonzero coordinates in u — v, namely 


Wt(u — v), is the same as the number of coordinates in which u and v 
differ, namely du, v). 


*In other words, if u = u,ug+--u, and Y = %4¥_°°> ¥, (with each u; v,either 1 or 0), then d(u, v) is the 
number of indices í such that u; + ¥; 
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(2) It suffices by (1) to prove that Wt(u — v) Ss Wt(u — w) + Wt(w — v). 
The left side of this inequality is the number of nonzero coordinates of 
u — v, and the right side is the total number of nonzero coordinates in 
u — w and w — v. So we need to verify only that whenever u — v has non- 
zero ith coordinate, at least one of u — w and w — v also has nonzero ith 
coordinate. Using the subscript i to denote ith coordinates, suppose the ith 
coordinate w — vof u — v is nonzero. If the ith coordinate u; — w, of 
u — w is nonzero, then there is nothing to prove. If u, — w, = 0, then 
u; = wp and, hence, w; — v; = w — v ¥ 0. Therefore, the ith coordinate 
wy — yof w — vis nonzero. W 


If a codeword u is transmitted and the word w is received, then the number of 
errors in the transmission is the number of coordinates in which u and w differ, that 
is, the Hamming distance from u to w. Since a large number of transmission errors is 
less likely than a small number (Exercise 27), the nearest codeword to a received word 
is most likely to be the codeword that was transmitted. Therefore, a received word is 
decoded as the codeword that is nearest to it in Hamming distance. If there is more 
than one codeword nearest to it, the decoder signals an error.* This process is called 
nearest-neighbor decoding.’ 


Definition A linear code is said to correct ¢ errors if every codeword that is trans- 
mitted with ¢ or fewer errors is correctly decoded by nearest-neighbor 


decoding. 


Theorem 16.2 


A linear code corrects ¢ errors if and only if the Hamming distance between 
any two codewords is at least 2t + 1. 


Proof > Assume that the distance between any two codewords is at least 2¢ + 1. 
If the codeword vu is transmitted with ¢ or fewer errors and received 
as w, then du, w) = t. If v is any other codeword, then du, v) = 2t + 1 
hypothesis. Hence, by Lemma 16.1, 


2t + 1 S dlu, v) S du, w) + dw, v) St + dw, v). 


Subtracting t from both sides of 2¢ + 1 = t + dw, v) shows that 

dw, v) = t + 1. Since du, w) = t, u is the closest codeword to w, so 
nearest-neighbor decoding correctly decodes w as u. Hence, the code 
corrects f errors. The proof of the converse is Exercise 15. E 


* Alternatively, the decoder can be programmed to choose one of the nearest codewords arbitrarily. 
This is usually done when retransmission is difficult or impossible. 

tUnder our assumptions in this chapter, nearest-neighbor decoding coincides with maximum- 
likelihood decoding. 
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Since only codewords are transmitted, errors are detected whenever a received 
word is not a codeword. 


Definition A linear céde is said to detect ¢ errors if the received word in any trans- 


mission with at least one, but no more than t errors, is not a codeword. 


Theorem 16.3 


A linear code detects ¢ errors if and only if the Hamming distance between 
any two codewords is at least £ + 1. 


Proof » Assume that the distance between any two codewordsis at least ¢ + 1. If 
the codeword u is transmitted with at least one, but not more than ¢ errors, 
and received as w, then 


0< du, w) £ t, and hence du, w) <t+ 1. 


So w cannot be a codeword. Therefore, the code detects ¢ errors. The 
proof of the converse is Exercise 16. E 


If u and v are distinct codewords, then d(u, v) is the weight of the nonzero code- 
word u — v by Lemma 16.1. Conversely, the weight of any nonzero codeword w is 
the distance between the distinct codewords w and 0 = 000 - - - 0 € B(x) because 
Wt(w) = Wt(w — 0) = dw, 0). Therefore, the minimum Hamming distance between any 
two codewords is the same as the smallest Hamming weight of all the nonzero codewords. 
Combining this fact with Theorems 16.2 and 16.3 yields. 


Corollary 16.4 


A linear code detects 2t errors and corrects £ errors if and only if the Hamming 
weight of every nonzero codeword is at least 2t + 1. 


EXAMPLE 5 


Let the message words be 00, 10, 01, 11 € B(2) and construct a (10, 2) code 
by assigning to each message word the codeword (element of B(10)) obtained 
by repeating the message word five times: 


0000000000, 1010101010, 0101010101, 1111111111. 


The set C of codewords is closed under addition and, hence, a subgroup of 
order 2? (Theorem 7.12). So Cis a (10, 2) code. Every nonzero codeword has 
Hamming weight at least 5 = 2+ 2 + 1. By Corollary 16.4 (with ¢ = 2), the 
code C corrects two errors and detects four errors. 
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By constructing codes that repeat the message words a large number of times (five 
in the last example), you can always guarantee a high degree of error detection and 
correction. The disadvantage to such repetition codes is their inefficiency when long 
messages must be sent. It is time consuming and expensive to transmit a large number 
of digits for each message word. So the goal is to construct codes that achieve an ac- 
ceptable accuracy rate without unnecessarily reducing the transmission rate. 

One efficient technique for constructing linear codes is based on matrix multipli- 
cation. Codes constructed in this way are automatically equipped with an encoding 
algorithm that assigns each message word to a unique codeword. 


EXAMPLE & 
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We shall construct a (7, 4) code. The message words will be the elements of 
B(4), and the codewords elements of B(7). Message words are considered as 
row vectors and converted to codewords by right multiplying by the following 
matrix, whose entries are in Z3: 


oof 


0 


Se O se 


Sr = (Oz 
Ke OC CO 


0 
1 
1 
1 


=. m O m 


Se 0o m.m 


For instance, the message word 1101 is converted to the codeword 1101001 because 


(1101) 


1 0 0 
0 1 0 
00 1 
00 0 


0 
0 
1 


1 
1 
0 
1 


=(1 


101001). 


The complete set C of codewords may be found similarly: 


Message Word Codeword 


Message Word 
0000 
0001 
0010 
0011 
0100 
0101 
0110 
0111 


Codeword 


0000000 
0001111 
0010110 
0011001 
0100101 
0101010 
0110011 
0111100 


1000 
1001 
1010 
1011 
1100 
1101 
1110 
1111 


1000011 
1001100 
1010101 
1011010 
1100110 
1101001 
1110000 
1111111 


Theorem 16.6 below shows that C is actually a subgroup of B(7). So C is a (7, 4) 
code, called the (7, 4) Hamming code. The preceding table shows that every nonzero 
codeword has Hamming weight at least 3 = 2 - 1 + 1. Hence, by Corollary 16.4 
(with ¢ = 1) this code corrects single errors and detects double errors. 


‘Crete 2012 Cones l amia ANR Ramet bay et ts viet eases or Anglica, in whate or a part Dus to elscronic dghts, mana third panty conta may de 
thet soy capprenend tant doas oot menmriafy affect the oraal laing experiama Omage Leaning <field 


ave ths right © aove 


mppmmai Gren the eBodk rador aC haptar(s). Edimwial review hes 
Coates of en tune if plagua rights restrictions Fequire it. 


478 Chapter 16 Algebraic Coding Theory 


The table in Example 6 shows that codewords in the Hamming (7, 4) code have a 
special form: The first four digits of each codeword form the corresponding message 
word. For instance, 7707001 is the codeword for 1101.* An (n, k) code in which the first 
k digits of each codeword form the corresponding message word is called a systematic 
code. All the examples above are systematic codes. Systematic codes are convenient 
because codewords are easily translated back to message words: Just take the first k 
digits. 

We can construct other systematic codes by following a procedure similar to that 
in the last example. A k X n standard generator matrix is a k X n matrix G with entries 


in £ of the form 
10 0 O a e Aek 
0 1 0 0 0 an “ss aidh 
Ci? a3 = (1 A), 
~ 0 O sra HO agent we GR 
~ Oo OO ox @ A Aki Bae Ak n—k 


where J, is the k X k identity matrix and A is ak X (n — k) matrix. For instance, the 
matrix G in Example 6 is a 4 X 7 standard generator matrix. It has the form (J, | 4), 
where A is a 4 X 3 matrix. 

A standard generator matrix can be used as an encoding algorithm to convert ele- 
ments of B(k) into codewords (elements of B(z)) by right multiplication. Each u € B(k) 
is considered as a row vector of length k. The matrix product uG is then a row vector 
of length x, that is, an element of B(x). Because the first k columns of G form the 
identity matrix Xg, the first k coordinates of the codeword uG form the corresponding 
message word u E B( k) (Exercise 23). In order to justify calling uG a “codeword,” we 
must show that the set of all such elements is a subgroup of B(n). 


Lemma 16.5 


If f:B(k) — B(n) is an injective homomorphism of groups, then the image of f 
is an (a, k) code. 


Proof » 1m fis a subgroup of B(n) that is isomorphic to B(k) by Theorem 7.20. 
Therefore, Im f has order 2* and, hence, is an (n, k) code. E 


Theorem 16.6 


If G isa k X n standard generator matrix, then {uG | u E&(k)} is a systematic 
(n, k} code. 


Proof » Define a function fB(k) = B(n) by f(u) = uG. The image of f is 
{f(u) | ue B(k)} = {uG | ue B(k)}. By Lemma 16.5 and the italicized 


*The last three digits of each codeword are check digits that can be used to determine if a 
received word is a codeword; see Exercise 22. 
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remarks preceding it, we need to show only that fis an injective homo- 
morphism of groups. Since matrix multiplication is distributive, 


Sut v) = (ut v)G = uG + vG = f(u) + f(v). 


Hence, f is a homomorphism of groups. 

If u = ugu, ++ + ug E B(k), then the first k coordinates of uG are 
uju ` * * up because Gis a standard generator matrix, and similarly for 
v = vyu * ++ vg E BK). We use this fact to show that f is injective. If 


S(u) = f(v), then in B(x) 

ujug * ** Uy seeae = UG = f(u) = f(v) = vG = vv, + * + Uy HHEREK, 
where the +’s indicate the remaining coordinates of uG and vG. Since 
these elements of B(n) are equal, they must be equal in every coordinate. 


In particular, u) = 2%, uz = Uz - » . , Ug = vy. Therefore, u = v in B(k), 
and fis injective. W 


EXAMPLE 7 


By Theorem 16.6, the standard generator matrix 


generates the (6, 3) code {uG | u € B(3)}. Verify that the encoding algorithm 
u — uG produces these codewords: 


Message Word Codeword Message Word Codeword 
000 000000 100 100011 
001 001110 101 101101 
010 010101 110 110110 
011 011011 111 111000 


Since the Hamming weight of every nonzero codeword is at least 3, this code 
corrects single errors and detects double errors by Corollary 16.4 (with ¢ = 1). 


Describing a large code by means by a standard generator matrix is much more 
efficient than listing all the codewords. For instance, in a (50, 30) code there are only 
1500 entries in the 30 X 50 generator matrix, but more than a billion codewords. 

Linear algebra can be used to show that every systematic linear code is given by 
a standard generator matrix. The standard generator matrices for the codes in the 
examples above are in Exercises 7-9. 
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E Exercises 


A. 


= 


. Show that C = {0000, 0101, 1010, 1111} is a (4, 2) code. 
. Find the Hamming weight of 

(a) 0110110€ B(7) (b) 11110011 € B(8) 

(c) 000001 €B(6) (d) 101101101101 € B(12) 
. Find the Hamming distance between 

(a) 0010101 and 1010101 

(b) 110010101 and 100110010 

(c) 111111 and 000011 

(d) 00001000 and 10001000 


4. Use nearest-neighbor decoding in the Hamming (7, 4) code to detect errors 
and, if possible, decode these received words: 


N 


w) 


(a) 0111000 (b) 1101001 
(c) 1011100 (d) 0010010 
5. List all codewords generated by the standard generator matrix: 
1000 101 1 
b 
BiG otal @io pate 
1001 loo Lt w 
{0101 Gy | @ ot oa 
0010 001110 


6. Determine the number of errors that each of the codes in Exercise 5 will 
detect and the number of errors each will correct. 


7. Show that the standard generator matrix 


100001 
010001 
G=|0 010041 
000101 
00001 1 


generates the (6, 5) parity-check code in Example 2. [Hint List all the 
codewords generated by G; then list all the codewords in the parity-check 
code; compare the two lists] 
8. Show that the standard generator matrix 
1 010 1 0 
G — 
fs 1 0 1 0 y 
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generates the (10, 2) repetition code in Example 5. [Hint: See the hint for 
Exercise 7.] 


9. Show that 1 X 4 standard generator matrix (1 1 1 1) generates the code in 
Example 1. 


10. Prove that B(n) = Z, X Z, X Z, X +++ X Z, (n factors) with coordinatewise 
addition is an abelian group of order 2”. 


B. 11. Prove that for any u, v, w E B(n), 
(a) du, v) = dv, u). 
(b) du, v) = 0 if and only if u = v. 
(c) du, v) = du + w, v + w). 
12. Prove that an element of B(6) is a codeword in the (6, 5) parity-check code 


(Example 2) if the sum of its digits is 0. [Hint- Compare the sum of the first 
five digits with the sixth digit.] 


13. Prove that the set of all codewords in the (6, 5) parity-check code (Example 2) 
is a subgroup of B(6). [Hint: Use Exercise 12.] 

14. If u and v are distinct codewords of a code that corrects ¢ errors, explain why 
Ku, v) =t. 


15. Complete the proof of Theorem 16.2 by showing that if a code corrects £ 
errors, then the Hamming distance between any two codewords is at least 
2t + 1. [Hint: If u, v are codewords with d(u, v) £ 2t, obtain a contradiction 
by constructing a word w that differs from u in exactly t coordinates and from 
vin t or fewer coordinates; see Exercise 14.] 


16. Complete the proof of Theorem 16.3 by showing that if a code detects ¢ errors, 
then the Hamming distance between any two codewords is at least f + 1. 


17. Construct a (5, 2) code that corrects single errors. 
18. Show that no (6, 3) code corrects double errors. 


19. Construct a (7, 3) code in which every nonzero codeword has Hamming 
weight at least 4. 


20. Is there a (6, 2) code in which every nonzero codeword has Hamming weight 
at least 4? 


99 66, 


21. Suppose only three messages are needed (for instance, “go,” “slow down,” 
“stop”). Find the smallest possible n so that these messages may be 
transmitted in an (z, k) code that corrects single errors. 


22. Let G be the standard generator matrix for the (7, 4) Hamming code in 
Example 6. 


(a) If u = (ui, tz, u3, u4) is a Message word, show that the corresponding 
codeword uG is 


(uy, Ug, Ug, Uy, Uz F Uy F ty, Uy + try + ug, Uy + Ly + ug). 
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(b) If v = (v, v, v3, V4, Us, Vés V7) E B(T), show that v is a codeword if and 
only if its last three coordinates (the check digits) satisfy these equations: 
Us = U2 + Vy + V4 
Vg = V + v3 + vA 
u= u + v2 + VA 


23. If G is a k X n standard generating matrix and u = uyuzuy * * * ug is a message 
word, show that the first k digits of the codeword uG are uy, Uz, - «+» Me 


24. If Cis a linear code, prove that either every codeword has even Hamming 
weight or exactly half of the codewords have even Hamming weight. 


25. Prove that the elements of even Hamming weight in B(n) form an (n, n — 1) 
code. 


26. If k < n and f:B(k) > B(n) is a homomorphism of groups, is Im f'a linear 
code? Is Im fan (n, k) linear code? 


NOTE: A knowledge of elementary probability and a calculator are needed for 
Exercises 27-31. 


27. Assume that the probability of transmitting a single digit incorrectly is .01 
and that a four-digit codeword is transmitted. Construct a suitable probability 
tree and compute the probability that the codeword is transmitted with 


(a) no errors; (b) one error; 
(c) two errors; (d) three errors; 
(e) four errors; (f) at least three errors. 


28. Do Exercise 27 for a five-digit codeword. 


29. Suppose the probability of transmitting a single digit incorrectly is greater 
than .$. Explain why “inverse decoding” (decoding 1 as 0 and 0 as 1) should 
be employed. 

30. Assume that the probability of transmitting a single digit incorrectly is .01 
and that M isa 500-digit message. 


(a) What is the probability that M will be transmitted with no errors? 


(b) Suppose each digit is transmitted three times (111 for each 1, 000 for 
each 0) and that each received digit is decoded by “majority rule” (111, 
110, 101, 011 are decoded as 1 and 000, 001, 010, 100 as 0). What is 
the probability that the message received when M is transmitted will be 
correctly decoded? [Hint: Find the probability that a single digit will be 
correctly decoded after transmission.] 


31. (a) Show that the number of ways that & errors can occur in an n-digit 
message is (al where f is the binomial coefficient. 


(b) If p is the probability that a single digit is transmitted incorrectly and q is 
the probability that it is transmitted correctly, show that the probability 


that k errors occur in an n-digit message is (7) rr ga 
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16.2 Decoding Techniques 


Nearest-neighbor decoding for an (7, k) code was implemented in Section 16.1 by 
comparing each received word with all 2* codewords in order to decode it. But when k 
is very large, this brute-force technique may be impractical or impossible. So we now 
develop decoding techniques that are sometimes more efficient. One of them is based 
on groups and cosets. 


EXAMPLE 1 


Let C be the (5, 2) code {00000, 10110, 01101, 11011}. From the elements of 
B(5) not in C, choose one of smallest weight (which in this case is weight 1), say 
= 10000. Form its coset e, + C by adding e, successively to the elements of 

Cand list the coset elements, with e, + c directly below r for each cE C: 


Ç: 00000 | 10110 01101 11011 
e, + C: 10000 | 00110 11101 01011 


Thus, for example, 11101 is directly below 01101 € C because e, + 01101 = 10000 + 
01101 =.11101. Among the elements not listed above, choose one of smallest weight, 
say e, = 01000, and list its coset in the same way (with e, + c belowc € C): 


Cc: 00000 | 10110 01101 11011 
e + C: 10000 | 00110 11101 01011 
eat C: 01000! 11110 00101 10011 


Among the elements not yet listed, choose one of smallest weight and list its coset, 
and continue in this way until every element of B(5) is on the table. Venfy that this is 
acomplete table: 


00000 | 10110 01101 11011 Codewords 
10000 | 00110 11101 01011 

01000 | 11110 00101 10011 

00100 | 10010 01001 11111 Received Words 
00010 | 10100 01111 11001 

00001 | 10111 01100 11010 

11000 | 01110 10101 00011 

10001 | 00111 11100 01010 


The decoding rule (which will be justified below) is: Decode a received word w as the 
codeword at the top of the column in which w appears. For instance, 01001 (fourth row) 
is decoded as01101;and 01010 (last row) is decoded as 11011. Similarly, 11000 
(seventh row) is decoded as 00000. 


The decoding table in the example is called a standard array, and the decoding rule 
standard-array decoding or coset decoding. The same procedure can be used to con- 
struct a standard array for any code C. Its rows are the cosets of C, with C itself as the 
first row. Each is of the form e + C, where e is the coset leader (an element of smallest 
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weight in the coset and listed first in the row). The element e + ¢ (with cE C) is listed 
in the column below c and is decoded as c. 


Theorem 16.7 


Let C be an (n, k) code. Standard-array decoding for C is nearest-neighbor 
decoding. 


Proof » 1f wE B(n), then w = e + vee + C, where eis a coset leader and vis 
the codeword at the top of the column containing w. Standard-array 
decoding decodes w as v. We must show that v is a nearest codeword to 
w. If u €C is any other codeword, then w — u is an element of w + C. 
But w + C is the coset of e (because e = w—vew + C). By construc- 
tion, the coset leader e has smallest weight in its coset, so Wt(w — u) = 
Wi(e). Therefore, by Lemma 16.1, 


d(w, u) = Wt(w — u) = Wt(e) = Wt(w—v) = Aw, v). 
Thus v is a nearest codeword tow. W 


When nearest-neighbor decoding is implemented by a standard array, a codeword is 
automatically chosen whenever there is more than one codeword that is nearest to a re- 
ceived word w (rather than an error being signaled). So incorrect decoding may occur in 
such cases. The code in the last example corrects single errors (every codeword has weight 
at least 3; see Corollary 16.4). Since two or more errors are much less likely than a single 
one, standard-array decoding for this code has a high rate of accuracy (Exercise 18). 

Once a standard array has been constructed, it’s much more efficient for decoding 
than brute-force comparison with all codewords. Unfortunately, constructing a stan- 
dard array for a large code may require as much computer time and memory as brute 
force. But when a code is given by a generator matrix, a much shorter decoding array 
is possible, as we now see. 

Consider an (n, k) code with k x n standard generator matrix G = (X | A). The 


A 
parity-check matrix of the code is the n x (n — k) matrix H = E) 
an —k. 


EXAMPLE 2 


Verify that the standard generator matrix for the (5, 2) code {00000, 10110, 
01101, 11011} of Example 1 is 


ft & 1 & Me 
e=(' 110 t) = Gal). 


*Since the generator matrix can always be obtained from the parity-check matrix, many books on coding 
theory define a code in terms of its parity-check matrix rather than its generator matrix. In most books, 
the parity-check matrix is defined to be the transpose of our matrix H, that is, the (k — n) X n matrix 
whose ith row is the same as the /th column of H. The matrix H is more convenient here, and, in any case, 
all theresults are easily translated from one notation to the other. 
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Here k = 2, n = 5, n — k = 3, and A is 2 x 3. So the parity-check matrix is the 
5 x3 matrix 


Il 
O O m m m 
O m= O O m 


re OOrO 
I) 
m~ 
um | he 
SS 


Verify that the product matrix GH is the 2 x 3 zero matrix. The phenomenon 
occurs in the general case as well. 


Lemma 16.8 
f G = (|A) is the standard generator matrix for a linear code and 
H= is its parity-check matrix, then GH is the zero matrix. 


n-k 


Proof > The entry in row i and column j of GH is the product of the ith row of G 
(see page 478) and the jth column of H:* 


(nôn ° ++ Sy + * Oganan *Qy*** Quy) 


nny 
= byay + Spay + +++ + yay +++ + + Snag 
+ ay8y + aoðy +--+ ayy + E + Arn- jn- 


Since 5,, = 0 whenever r ¥ s and since addition is in Z3, this sum reduces to 
Öudy + aydy = lay + ayl = ay +a =0. E 


In an (n, k) code with k x n standard generator matrix G, every received word 
w E B(n) is a row vector of length n. Since the parity-check matrix H is n x (n — k), 


*The Kronecker delta symbol 8,, is defined as follows: when r = S, Ôm = 1 and when r £s, ,, = 0. 
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the product wH is a row vector of length n — k, that is, an element of B(n — k). Let 0 
denote 000 « - - 0E B(n — k). 


EXAMPLE 3 


Let H be the 5 x 3 parity-check matrix for the (5, 2) code in Example 2. Then 
11000H = 011 and 10110H = 0: 


(11000) =(011) and 


oe 
oroc$cr 
eocro 


(10110) = (00 0). 


O O m m m 
oro oO 
Poor oso 


The fact that 10110 is a codeword in this code and 10110H = 0 is an example of 
the following Theorem. 


Theorem 16,9 


Let C be an (n, k) code with standard generator matrix G and parity-check 
matrix H. Then an element w in &(n) is a codeword if and only if wH = Q. 


Proof » Define a function f-B(n) > B(n- k) by f (w) = wH. Then fis a homo- 
morphism of groups (same argument as in the proof of Theorem 16.6). 
Now w is a codeword if and only if we C. Also, we K (the kernel of f) 
if and only if wH = 0. So we must prove that w€ Cif and only if w EK, 
that is, that C = K. By the definition of generator mateix, every element 
of Cis of the form uG for some u E B(k). But (uG)H = uw(GH) = 
because GH is the zero matrix (Lemma 16.8). Therefore, C& K. Since 
Cis a subgroup of order 2*, we need to show only that K has order 2* in 
order to conclude that C = K. 

Exercise 14 shows that fis surjective. By the First Isomorphism 
Theorem 8.20, B(n — k) = B(n)/K, and, hence, by Lagrange’s Theorem 8.5, 


= |B(n)| = |K| [B():K] 
= |K] + |B0)/K] = |K] + |Bín — K) = |K]: 2. 


Dividing the first and last terms of this equation by 2"—* shows that 
k 
|K|=2°. m 
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Corollary 16.10 


Let C be a linear code with parity-check matrix H and let u, ve 8(n). Then u 
and v are in the same coset of C if and only if vH = vH. 


Proof » To say that u and v are in the same coset means u + C = v + C. 
Theorem 8.2 in additive notation shows that 
u+C=uv+C if and only if u — vEC. 
By Theorem 16.9, 
u—veEC ifandonlyif (u — v)H =0. 


Since matrix multiplication is distributive, (u — v)H = uH — vH. Also, 
uH — vH = Ois equivalent to uH = vH. Hence, 


(u-—vH=0 if and only if uH = vH. 
Combining the three centered statements above proves the theorem. W 


If we Bin) and H is the parity-check matrix, then wH is called the syndrome of w. 
By Corollary 16.10, w and its coset leader e have the same syndrome. If w = e + v with 
v EC, the standard array decodes w, as v = w — e. Therefore, standard-array (nearest- 
neighbor) decoding can be implemented as follows: 


1. If w is a received word, compute the syndrome of w (that is, wH). 
2. Find the coset leader e with the same syndrome (that is, eH = wH). 
3. Decode was w — e. 


Since this procedure (called syndrome decoding) requires only that you know the syn- 
dromes of the coset leaders, the standard array can be replaced by a much shorter table. 


EXAMPLE 4 


The coset leaders for the (5, 2) code {00000, 10110, 01101, 11011}, as shown in 
Example 1, are 


00000, 10000, 01000, 00100, 00010, 00001, 11000, 10001. 


Multiplying each of them by the parity-check matrix H given in Example 2 produces 
its syndrome: 
Syndrome 000 110 101 100 O10 001 01l 111 
Coset Leader | 00000 10000 01000 00100 00010 000001 11000 10001 


To decode w = 01001, for example, we compute 01001 H = 100. The table shows 
that the coset leader with this syndrome is e = 00100. So we decode w as w — e = 
01001 — 00100 = 01101. 


Depending on the size of the code and whether or not coset leaders can be 
determined without constructing the entire standard array, syndrome decoding may 
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be more efficient than brute-force nearest-neighbor decoding. For example, a (56, 48) 
code has 2 (approximately 2.8 x 10'*) codewords but only 2° = 256 cosets. 

Standard-array and syndrome decoding are complete decoding schemes, meaning 
that they always find a nearest codeword for each received word. When retransmission 
of the message is impractical, complete decoding is a necessity. But when retransmis- 
sion is feasible, it may be better to use an incomplete decoding scheme that corrects 
t errors and requests retransmission when more than £ errors are detected. We now 
describe one such scheme. 

Let e,€ B(n) denote the row vector with 1 in coordinate i and 0 in every other 
coordinate. In B(3), for instance, e} = 100, e, = 010, and e; = 001. Each e, has weight 1; 
in fact 


Ets Coy «+» » Cg are the only elements of weight 1 in B(x). 
Consider the product of e, € B(3) and this matrix H: 


101 
eH = (01 0)|O0 1 1|=(0 1 1)=row2ofH. 
LWA 


Exercise 10 shows that the same thing happens in the general case. If 2;€ B(n) and H 
is a matrix with 7 rows, then 


e;H is the ith row of the matrix H. 
Now assume that C is a linear code with parity-check matrix H and that the rows 
of H are nonzero and no two of them are the same. Then e,H = ith row of H + 0 by 
hypothesis; hence, by Theorem 16.9, 


e; is not a codeword. 


Furthermore, if i #/, then e, and e, cannot be in the same coset of C (otherwise row i 
of H = e,H = eH = row j of H by Corollary 16.10). Thus 


e; is the only element of weight 1 in its coset. 
So every other element in the coset of e; has weight at least 2.* Consequently, 
e; is always the coset leader in its coset. 


Finally, if the syndrome of a received word w is the ith row of H, then wH = e,H, so 
w and gare in the same coset by Corollary 16.10. 


*The only element of weight 0 is 000 - - -0, whose coset is C. C is not the coset of e; because e, is 
not a codeword. 
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The preceding paragraph suggests a convenient way to implement (possibly incom- 
plete) syndrome decoding when the rows of H are nonzero and distinct: 
1. If wis received, compute its syndrome wH. 
2. If wH = 0, decode w as w (because w is a codeword by Theorem 16.9). 
3. If wH #0 and wH is the ith row of H, decode w by changing its ith coordinate 
(that is, decode w as w — e, because e, is w’s coset leader). 
4. If wH#0 and wHis not arow of H, do not decode and request a retransmission. 


This scheme (called parity-check matrix decoding) can be easily implemented with large 
codes because there is no need to compute cosets or find coset leaders. Furthermore, 


Theorem 16.11 


Let C be a linear code with parity-check matrix H. If every row of H is 
nonzero and no two are the same, then parity-check matrix decoding 
corrects ali single errors. 


Proof > When a codeword u is transmitted with exactly one error in coordinate 
i and received as w, then w — u = e, By Theorem 16.9, wH = (e+ u)H = 


eH + uH = eH + 0 = eH, which is the ith row of H. Therefore, w is 
correctly decodedasw—e=u. Wm 


EXAMPLE 5 


Let C be the (5, 2) code whose parity-check matrix H is give in Example 2. If 
10011 is received, its syndrome is 


(10011)H=(1001 1) 


oer = m 
O =- O O = 
=. OOK © 


=(1 0 1)=row2of H. 


Therefore, 10011 is decoded as 10011 — e, = 10011 — 01000 = 11011. If 11000 
is received, verify that its syndrome is 011, which is not a row of H. Therefore, 
11000 is not decoded, and a retransmission is requested. 


In one important class of codes, parity-check matrix decoding is actually complete 
syndrome (nearest-neighbor) decoding. 
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EXAMPLE 6 


The standard generator matrix G for the Hamming (7, 4) code was given in 
Example 6 of Section 16.1. Its parity-check matrix H has distinct, nonzero 
TOWS: 


X 

Il 
Domene 
or orr or 
-00.0mm 

7 


The possible syndromes of a received word w in this code are 000 and the seven 
nonzero elements of B(3). But all the nonzero elements of B(3) appear as rows 
of H. So every syndrome either is 000 (decode w as itself) or is the ith row of H 
for some i (decode w by changing its ith coordinate). Therefore, every received 
word is decoded. 


Example 6 is one of an infinite class of codes that can be described by using the 
fact that a linear code is completely determined by its parity-check matrix (from which 
a standard generator matrix is easily found). Let r = 2 be an integer and let n = 2” — 1 
and k = 7 — 1 —r. Then n — k = r. The preceding example is the case r = 3. Let H 
be then X (n — k) matrix whose last r rows are the identity matrix J, and whose n rows 
consist of al! the nonzero elements of B(r). Since the number of nonzero elements in 
B(r) is 2’ — 1 = n, each nonzero element appears exactly once as a row of H. So the 
rows of H are distinct and nonzero. The code with this parity-check matrix is called a 
Hamming code. 

In every Hamming code, all possible syndromes are rows of H. So parity-check 
matrix decoding is complete syndrome.decoding that corrects all single errors. 


E Exercises 


A. 1. Find the parity-check matrix of each standard generator matrix in Exercise 5 
of Section 16.1. 


2. Find the parity-check matrix for the code in Example 7 of Section 16.1. 


3. Find the parity-check matrix for the parity-check code in Example 2 of 
Section 16.1. [See Exercise 7 in Section 16.1.] 


4. Find the parity-check matrix for the (10, 2) repetition code in Example 5 of 
Section 16.1. [See Exercise 8 in Section 16. 1.] 


5. Find a parity-check matrix for the (15, 11) Hamming code. 
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6. Show that the linear code C with parity-check matrix cannot correct 


or oO = Ọ = 
=- O = OỌ = © 


every single error. 


1011 
7. Let C be the (4, 2) code with standard generator matrix G = ( ohare a 


Construct a standard array for C and find the syndrome of each coset leader. 

8. Construct a standard array for the (6, 3) code in Example in 7 of Section 16.1 
and find the syndrome of each coset leader. 

9. Choose new coset leaders (when possible) for the (5, 2) code in Example 1 and 
use them to construct a standard array. How does this array compare with the 
one in Example 1? 

10. Let e; = 00 - - - 010 - - +00 € B(x) have 1 in coordinate i and 0 elsewhere. If H is 
a matrix with n rows, show that e,H is the ith row of H. 
B. 11. Suppose a codeword u is transmitted and w is received. Show that standard- 
array decoding will decode w as u if and only if w — u is a coset leader. 
12. If every element of weight = ¢ is a coset leader in a standard array for a code 
C, show that C corrects t errors. 
13. If a codeword v is transmitted and w is received, then e = w — wis called 
an error pattern. Prove that an error will be detected if and only if the 
corresponding error pattern is not a codeword. 
14. Prove that the function f:B(n) — B(n — k) in the proof of Theorem 16.9 is 
surjective. [Hint: If v = viv, +> * v,_, E B(n — k), show that v = f(u), where 
u = 000 + + - Ove + + + va- E B(n).] 
15. Let C bea linear code with parity-check matrix H. Prove that C corrects single 
errors if and only if the rows of H are distinct and nonzero. 
16. Show by example that parity-check matrix decoding with the Hamming (7, 4) 
code cannot detect two or more errors. 
17. Show that in any Hamming code, every nonzero codeword has weight at least 3. 
18. [Probability required.] In the (5, 2) code in Example 1, suppose that the 
probability of a transmission error in a single digit is .01. 
(a) Show that the probability of a single codeword being transmitted without 
error is .95099. 
(b) Show that the probability of a 100-word message being transmitted 
without error is less than .01. 
(c) Show that the probability of a single codeword being transmitted with 
exactly one error is .04803. 
(d) Show that the probability that a single codeword is correctly decoded by 
the standard array in Example | is at least .99921. 
(e) Show that the probability of a 100-word message being correctly decoded 
by the standard array is at least .92. [Hint: Compare with part (b).] 
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16.3 BCH Codes 


The Hamming codes in the last section have efficient decoding algorithms that correct 
all single errors. The same is true of the BCH codes* presented here. But these codes 
are even more useful because they correct multiple errors. 

The construction of a BCH code uses a finite ring whose additive group is (isomor- 
phic to) some B(n). Each ideal in such a ring is a linear code because its additive group 
is (isomorphic to) a subgroup of B(z). The additional algebraic structure of the ring 
provides efficient error-correcting decoding algorithms for the code. 

The finite rings in question are constructed as follows, Let n be a positive integer 
and (x" — 1) the principal ideal in Z,[x] consisting of all multiples of x* — 1. 
The elements of the quotient ring Z,[x]/(x" — 1) are the congruence classes (cosets) 
modulo x" — 1. By Corollary 5.5, the distinct congruence classes in Z, [x]/(x* — 1) are 
in One-to-one correspondence with the polynomials of the form 


(+) m taxt ap? t. H aa a, with a,E Z2 
Each such polynomial has n coefficients, and there are two possibilities for each coef- 
ficient. Hence, Z,[x]/(x* — 1) is a ring with 2" elements. Furthermore, the n coefficients 


(Gp, G1, âz, - . . , &_1) Of the polynomial (+) may be considered as an element of the 
group B(n) = Za X +--+ XZ. 


Theorem 16.12 


The function f:2, [x]/(x"” — 1) > B(n) given by 

Klao + a,x + aX? + + + > apy XT) = (Bo) By) os © > Ant) 
is an isomorphism of additive groups. 
Proof > Exercise 7. m 


Theorem 16.12 shows that every ideal of Z,[x]/(x" — 1) can be considered as 
a linear code since it is (up to isomorphism) a subgroup of B(x). In particular, if 
a(x) €#,[x], then the congruence class (coset) of g(x) generates a principal ideal J in 
ZAx/0e — 1). The ideal J consists of all congruence classes of the form [A(x)g(x)] with 
h(x) €Z,[x]. BCH codes are of this type. 

In order to define a BCH code that corrects ¢ errors, choose a positive integer r such 
that t < 27). Let n = 2 — 1. Then g(x) is determined by considering a finite field of 
order 2’, as explained below. 


EXAMPLE 1 


We let t = 2 and r = 4, so that n = 2% — 1 = 15. We shall construct a code in 
Z24x](x'5 — 1) that corrects all double errors by finding an appropriate g(x). To 
do this, we need a field of order * = 16. 


*The initials BCH stand for Bose, Chaudhuri, and Hocquenghem, who invented these codes in 
1959-1960. 
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The polynomial 1 + x + x“ is irreducible in Z.[x] (Exercise 3). Hence, 
K = ZJx]/(1 + x + x‘) isa field of order 16 by Theorem 5.10 (and the remarks 
afterit). By Theorem 5.11, K contains a root a of 1 + x + x*. Using the fact that 


l+a+t+at=0  and,hence, at=1+a* 


we can compute the powers of œ. For example, aô = œa! = a? (1 + a) =a? + œ. 
Similarly, we obtain 


a= &=e +e e=ataet+ad 


a 
a2 =e d =l+a+a a?=1+a+e + 
a= a&=1+a? a®=1+a+a3 
at=l+a e=atae a*=1+e 

=a +a al =1+a+a a=] 


These elements are distinct and nonzero by statements (1) and (2) of Theorem 11.7 
(with u = a and p(x) = 1 + x + x‘). Therefore, they are all the nonzero ele- 
ments of K, anda is a generator of the multiplicative group of K. 

To construct the polynomial g(x), we first find the minimum polynomials of 
a, a’, a, a* over Zp. By the construction of K, the minimal polynomial of a is 
m(x) = 1+ x + x. This polynomial m,(x) is also the minimal polynomial of 
a’ and af, for instance, by the Freshman’s Dream (Lemma 11.24), 


m(a?) = 1 + (a2) + (ey 
= 1+ (a) + (a? = (1 +a +a = P =0. 


Verify that the minimum polynomial of a? is m(x) = 1 +x +x? + £ + x 
(Exercise 5). The polynomial g(x) is defined as the product m,(x)m3(x), so that 


g(x) =l txt A+ x4+ 274+ 24 x) 
=] + xt + x$ + x+ PeEZ Ix]. 


Let C be the ideal generated by [g(x)] in Za[x] (xf — 1). Then Cis a code by 
Theorem 16.12. We shall see that C is a (15, 7) code that corrects all single and 
double errors. 

Just what do the codewords of C look like? By Corollary 5.5, each congruence class 
in Z,Jx]/(x'5 — 1) is the class of a unique polynomial of the form 


(++) a + ax + ax + +++ + agx? + aaa, with a; E Z}. 


So we shall denote the class by this polynomial.t When convenient, this poly- 
nomial will be identified (as in Theorem 16.12) with the element ay a; a, ++ > a4 = 
(ão, Qj, A2, -. - , 414) Of B(15). The codewords consist of the classes of polyno- 
mial multiples of g(x). For example, 


“Remember, 1 = —1in Z} 
tThis is analogous to what was done in Section 2.3, when we began writing elements (classes) in 
2, in the form & rather than [A]. 
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Codeword in Polynomial Form In K(15) Form 
ax) =1t+ 44354 x7 + x8 100010111000000 
reg(x) = x1 + x4 + x6 + x7 + x8) 
Hxt++x7+ 24+ >9 010001011100000 


(1 + x)g(x) = (1 + x) + 4 + x6 + x? + x4) 
HS Lt A Hx? + 8 + H xdg YI 4 14 100010011010111 


If g(x) is multiplied by a polynomial A(x) of degree = 7, then the codeword 
h(x)g(x) has degree = 15 and is not of the form (+*+). For example, if A(x) = xf, 
then 


A(x)g(x) = x*g(x) = x8 + xt + x6 + x7 + x4) 
= x8 $ xX? + xh + x + x, 
The polynomial of the form (++) that is in the same class as A(x)g(x) is the re- 
mainder when A(x)g(x) is divided by x'5 — 1 (see Corollary 5.5). Verify that 


ADR) = (1 + xs — 1) + (1 + xt xt + 8? + 2%), 


Hence, [f(x)g(x)] is the codeword 1 + x + xê + x? + x" or, equivalently, 
110000001000101. 


The procedure in Example 1 is readily generalized. If t is the number of errors the 
code should correct, let n = 2” — 1, where r is chosen so that f < 2°"! (in the example, 
t = 2, r = 4). By Corollary 11.26, there is a finite field K of order 2’. By Theorem 11.28, 
K = Za), where a is a generator of the multiplicative group of nonzero elements of 
K (and so has multiplicative order 2” — 1 = n). Let 


m(x), (x), m(x), . « . , Mx) €Z2[x] 
be the minimal polynomials of the elements 
a, œ, @,... p, 0 EK. 


Let g(x) be the product in Z,[x] of the distinct polynomials on the list m(x), 
MAX), « « - , ma). 

The ideal C generated by [g(x)] in Z,[x]/(x" — 1) is called the (primitive narrow- 
sense) BCH code of length z and designed distance 2¢ + 1 with generator polynomial 
g(x). So the code in Example 1 is a BCH code of length 15 and designed distance 
5 (= 2-2 + 1). If g(x) has degree m, then Exercise 14 shows that the code C is an 
(n, k) code, where k = n — m 


Theorem 16.13 


A BCH code of length n and designed distance 2t + 1 corrects t errors. 
Proof > The proof requires a knowledge of determinants; see Lidl-Pilz [32; 
page 230]. @ 
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Theorem 16.13 shows that there are BCH codes that will correct any desired num- 
ber of errors. More importantly, from a practical viewpoint, there are efficient algo- 
rithms for decoding large BCH codes.* A complete description of them would take 
us too far afield. But here, in simplified form, is the underlying idea of the error- 
correcting procedure. 

Let C be a BCH code of designed distance 2t + 1 and generator polynomial g(x). 
By the definition of g(x), each minimal polynomial m,(x) divides g(x). Hence, g(a’) = 0 
for each i= 1, 2,..., 2t. If [/(x)] is a codeword in C, then f(x) = A(x)g(x) for some 
A(x), and, therefore, 


S (a!) = hla") g(a’) = h(a!) -0 = 0. 


Conversely, if f(x)e Z[x] has every a! as a root, then every m,(x) divides f(x) by 
Theorem 11.6. This implies that g(x) | f(x) (Exercise 8). Therefore, 


| f(x)| is a codeword if and only if f(a’) = 0 for 1 <i 2r 


The decoder receives the word aga, + + - a, which represents the (class of) the 
polynomial 


Ax) = ag + ax + ax? +--+ + ape. 


The decoder computes these elements of the field K = 7,(a): 


(a), (a), (œ), ..., (a). 


If all of them are 0, then r(x) is a codeword by the remarks above. If certain ones are 
nonzero, the decoder uses them (according to a specified procedure) to construct a 
polynomial D(x) € K[x], called the error-locator polynomial. Since K is finite, the non- 
zero roots of D(x) in K can be found by substituting each a’ € K in D(x)]. 

If no more than t errors have been made, the nonzero roots of D(x) give the 
location of the transmission errors. For instance, if a? is a root, then a, is incor- 
rect in the received word r(x); similarly if a? = 1 is a root, then an error occured in 
transmitting ap. 

If D(x) has no roots in K or if certain of the r(a') are 0, so that D(x) cannot be 
constructed, then more than ¢ errors have been made. So the decoder follows set pro- 
cedures (omitted here) to choose arbitrarily a nearest codeword to r(x). 


EXAMPLE 2 


In the (15, 7) BCH code of Example 1, suppose this word is received: 


r(x) = x + x’ + x = 010000011000000. 


*This is one reason BCH codes are widely used. For example, the European and trans-Atlantic 
communication system used a BCH code with £ = 6 and r = 8. It is a (255, 231) code that corrects six 
errors with a failure probability of only 1 in 16 million. 
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Using the table at the beginning of Example | and the fact that u + u = O for 
every element u in K (Exercise 1), we have 
Hab=atadta=at(ltatat+(lt+ta=e+a =a 
r(a*) = œ + (œ) + (a2)? 
=@+a%+a%=a'+a5+ a? 
=O 4+ (P+ A+ (a+) at H =a", 
Exercise 6 shows that 
ra?) = ra)? = (a)? = a, 
Hat) = a) = ($ = a” = a. 
The error-locator polynomial is given by this formula (which is justified in 
Exercise 15): 


D(x) = x7 + r(a)x + (x + 3) 
rla) 


Using the table at the beginning of Example 1, we see that 


D(x) = x7 + a&x + (a2 + =) =x + ax + (a! + a) 


= £ +a% +a" 
By substituting each of the nonzero elements of K in D(x), we discover that 
D(a’) = (fF + afa? + a" = a! + al! + a 
=(1+a+a + (a+ +a’) +(1+ a) =0; 
D(a’) = (e°)? + af + a =a" +a +a =a +1+ 0! 
=œ +1 +(1 +a) =0. 
Therefore, a and a” are the roots of D(x), so errors occurred in the coefficients 
of x and x°. The received word 
r(x) =x + x? + x? = 01000001 1000000 
is corrected as 
(x) = x + xf + x’ + x8 + x = 010001011100000, 


which is a codeword (see page 494). 
Similarly, if r(x) = x? + xf + x? + x! = 0010001001 10000 is received, then 


rna) =a% Sa, ra*)=a°, and 
D(x) = x7 + r(a)x + [re + to] =æ + a'x + (c + =) 
= Ê + ax + (a +a) =x + ax = x(x + a’). 


The only nonzero root of D(x) is aê, so a single error occurred in the coefficient 
of x$, and the correct word is 


e(x) = x7 + x6 + xê +x + x! = 001000101110000. 
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Finally, if 1 + x + x“ is received, then 
nra)=lt+at+at= and Aæ)=l1 +æ +a? = ai. 
So D(x) cannot be constructed, and we conclude that more than two errors have 


occurred. Similarly, if 1 + x + X is received, then verify that D(x) = x* + a’x + a° 
and that D(x) has no roots in K. Once again, more than two errors have occurred. 


E Exercises 


NOTE: Unless stated otherwise, K is the field Z,[x|/(1 + x + x‘) of order 16 anda is a 
root of 1+ x +x‘, as in Example 1. 


A. 1. (a) Prove that f(x) + f(x) = 0 for every f(x) € Z,[x]. 
(b) Prove that u + u = 0 for every u in the field K. 


2. Show that the only irreducible quadratic in Z,[x] is x* + x + 1. 
[Hint: List all the quadratics and use Corollary 4.19.] 


3. Provethat 1 + x + X is irreducible in Z,[x]. [Hint: Exercise 2and Theorem 4.16.] 


4. Prove that the minimal polynomial of @% over Z, is 1 + x + x. 
[Hint: Use the table in Example 1.] 


5. (a) Prove that the minimal polynomial of a? over Z, is 1 + x + x2 + 2° + x. 
[Hint: Exercise 2, Theorem 4.16, and the table in Example 1.] 


(b) Show that a’ is also a root of 1 + x + x4. 
B. 6. If f(x) €Z,[x] and e is an element in some extension field of Z}, prove that for 
every k = 1, f(a) = f(a}. [Hint: Lemma 11.24,] 
7. (a) Show that the function f: Z.[x]/(x" — 1) > Bn) given by 
fle + ax + ay? + ++ + aay XM) = (aos di an >- © s Ay—1) 
is surjective. 
(b) Prove that f is a homomorphism of additive groups. 
(c) Prove that fis injective. [Hint: Theorem 8.17 in additive notation.] 
8. (a) Let F be a field and f(x) E F[x]. If p(x) and g(x) are distinct monic 
irreducibles in F[x] such that p(x) | f(x) and q(x) | f(x), prove that 


POJK) | f(x). [Hint: If f(x) = q(Qx)A(x), then p(x) | (x)A(x); use part (2) 
of Theorem 4.121] 

(b) If m(x), m(x), . . . , 772,(2) are distinct monic irreducibles in F [x] such that 
each máx) divides f (x), prove that g(x) = 72(x)m2(x) + + + m(x) divides f (x). 


9. Let C be the (15, 7) BCH code of Examples | and 2. Use the error-correction 
technique presented there to correct these received words or to determine that 
three or more errors have been made. 


(a) 1 + x = 110000000000000. 
(b) 1 +x? + x* + xê = 100111000000000. 
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(c) 1+ x? + 4 +x? = 101010010000000. 
(d) 1+ x£ + x? + x + xX? = 100000111 100000. 


Show that the generator polynomial for the BCH code with t = 3, r = 4, 
n= 15isg(x)=1+xt+ +x +x? +d + x", (Hint: Exercises 3-5 may 
be helpful] 


. Let K = Z,(a) be a finite field of order 2’, whose multiplicative group is 


generated by a. For each i, let (x) be the minimal polynomial of a over Z,. 
If n = 2’ — 1, prove that each m,(x) divides x” — 1. [Hint: a” = 1 (Why?); use 
Theorem 11.6.] 


. If g(x) is the generator polynomial of a BCH code in Z.[x]/(x" — 1), prove that 


g(x) divides x" — 1. [Hint: Exercises 11 and 8(b).] 


. Let g(x) €Z,[x] be a divisor of x" — 1 and let C be the principal ideal generated 


by [g(x)] in Z,[x]/(2* — 1). Then C is a code. Prove that C is cyclic, meaning 

that C (with codewords written as elements of B(n)) has this property: If 

(Cos C1,» + + s Cn-1) E C, then (Cy_ys Cos Crs» «+ s Sp—2) E C. [Hint: Cp-1 + eye Hee + 

tap =AMCot exter + Cne’) =i m=] 

Let C be the code in Exercise 13. Assume g(x) has degree m and let k = n — 

Let J be the set of all polynomials in Z,[x] of the form a + a,x + ax? + - 2 

appt. 

(a) Prove that every element in C is of the form [s(x)g(x)] with s(x) € J. [Hint: 
Let [A(x) g(x) E C. By the Division Algorithm, A(x)g(x) = (x)(x — 1) + 
r(x), with deg r(x) < n and [A(x)g(x)] = [r(x)]. Show that r(x) = s(x)g(x), 
where s(x) = h(x) — e(x)f(x) and 9(x)f(x) = x" — 1. Use Theorem 4.2 to 
show s(x)E J] 

(b) Prove that C has order 2*, and, hence, Cis an (n, k) code. [Hint: Use 
Corollary 5.5 to show that if s(x) # t(x) in J, then [s(x)g(x)] # [1(x)g(x)] in 
C. How many elements are in J7] 


. Let C be the (15, 7) BCH code of Examples | and 2, with codewords written 


as polynomials of degree =14. Suppose the codeword c(x) is transmitted 

with errors in the coefficients of xX and x and r(x) is received. Then D(x) = 

(x + a(x + of) € K[x], whose roots are of and œ, is the error-locator polynomial. 
Express the coefficients of D(x) in terms of (a), r(a?), r(a3) as follows. 


(a) Show that r(x) — c(x) = x + xX. 


(b) Show that r(a) = a"! + af for k = 1, 2, 3. [See the boldface statement on 
page 495,] 


(©) Show that D(x) = 7 + (a! + ax + al =x + r(a)x + al, 


(d) Show that a’*/ = #(a*) + a [Hint: Show that r(a)’ = (a + aY = 


a” + a” + ala! + a!) = r(a3) + qaa! and solve for a“; note that 


Hay? = r(?),] 
Show that a BCH code with ¢ = 1 is actually a Hamming code (see page 490). 
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APPENDIX A 


Logic and Proof 


This Appendix summarizes the basic facts about logic and proof that are needed to 
read this book. For a complete discussion of these topics see Galovich [7], Smith- 
Eggen-St. Andre [10], or Solow [11]. 


Logic 
A statement is a declarative sentence that is either true or false. For instance, each of 
these sentences is a statement: 

a is a real number. 

Every triangle is isosceles. 

103 bald eagles were born in the United States last year. 
Note that the last sentence is a statement even though we may not be able to verify its 
truth or falsity. Neither of the following sentences is a statement: 

What time is it? Wow! 


Compound Statements 


We frequently deal with compound statements that are formed from other statements 
by using the connectives “and” and “or”. The truth of the compound statement will 
depend on the truth of its components. If P and Q are statements, then 


“P and Q” is a true statement when both 
P and Q are true, and false otherwise. 


For example, 
T is a real number and 9 < 10 
is a true statement because both of its components are true. But 
a is areal number and 7 — 5 = 18 


is a false statement since one of its components is false. 
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In ordinary English the word “or” is most often used in exclusive sense, meaning 
“one or the other but not both,” as in 


He is at least 21 years old or he is younger than 21. 


But “or” can also be used in an inclusive sense, meaning “one or the other, or possibly 
both,” as in the sentence 


They will win the first game or they will win the second. 


Thus the inclusive “or” has the same meaning as “and/or” in everyday language In math- 
ematics, “or” is always used in the inclusive sense, which allows the possibility that both com- 
ponents might be true but does not require it. Consequently, if P and Q are statements, then 


“P or Q” is a true statement when at least one of P or Q 
is true and false when both P and Q are false. 


For example, both 
1>5 or 3+8=11 

and 
1>5 or 3+8=23 

are true statements because at least one component is true in each case, but 
4<2 or 5+3=12 


is false since both components are false. 


Negation 


The negation of a statement P is the statement “it is not the case that P”, which we can 
conveniently abbreviate as “not-P”. Thus the negation of 


7 is a positive integer 
is the statement “it is not the case that 7 is a positive integer”, which we would normally 
write in the less awkward form “7 is not a positive integer”. If P is a statement, then 


The negation of P is true exactly when P is false, and 
the negation of P is false exactly when P is true. 


The negation of the statement “P and Q” is the statement “it is not the case that P and 
Q”. Now “P and Q” is true exactly when both P and Q are true, so to say that this is 
not the case means that at least one of P or Q is false. But this occurs exactly when at 
least one of not-P or not-Q is true. Thus 


The negation of the statement “P and Q” is the statement 
“not-P or not-Q”. 


For example, the negation of 
fis continuous and f is differentiable at x = 5 
is the statement 
fis not continuous or fis not differentiable at x = 5. 


The negation of the statement “P or Q” is the statement “it is not the case that P 
or Q”. Now “Por Q” is true exactly when at least one of P or Q is true. To say that this 
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is not the case means that both P and Q are false. But P and Q are both false exactly 
when not-P and not-Q are both true. Hence, 


The negation of the statement “P or Q” is the statement 
“not-P and not-Q”. 


For instance, the negation of 
119 is prime or V3 is a rational number 
is the statement 


119 is not prime and V3 is not a rational number. 


Quantifiers 


Many mathematical statements involve quantifiers. The universal quantifier states that 
a property is true for all the items under discussion. There are several grammatical 
variations of the universal quantifier, such as 


For all real numbers c, ¢ > —1. 

Every integer isa real number. 

All integers are rational numbers. 

For each real number a, the number a” + 1 is positive. 

The existential quantifier asserts that there exists at least one object with certain 

properties. For example, 

There exist positive rational numbers. 

There exists a number x such that x? — Sx + 6 = 0. 

There is an even prime number. 
In mathematics, the word “some” means “at least one” and is, in effect, an existential 


quantifier. For instance, 
Some integers are prime 


is equivalent to saying “at least one integer is prime”, that is, 
There exists a prime integer. 


Care must be used when forming the negation of statements involving quantifiers. 
For example, the negation of 


All real numbers are rational 


is “it is not the case that all real numbers are rational”, which means that there is at 
least one real number that is irrational (= not rational). So the negation is 


There exists an irrational real number. 


In particular, the statements “all real numbers are not rational” and “all real num- 
bers are irrational” are not negations of “all real numbers are rational”. This example 
illustrates the general principle: 


The negation of a statement with a universal quantifier 
is a statement with an existential quantifier. 
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The negation of the statement 
There exists a positive integer 


is “it is not the case that there is a positive integer”, which means that “every integer is 
nonpositive” or, equivalently, “no integer is positive”. Thus 


The negation of a statement with an existential quantifier 
is a statement with a universal quantifier. 


Conditional and Biconditional Statements 


In mathematical proofs we deal primarily with conditional statements of the form 
If P, then Q 


which is written symbolically as P = Q. The statement P is called the hypothesis or 
premise, and Q is called the conclusion. Here are some examples: 


If cand dare integers, then cd is an integer. 
If f is continuous at x = 3, then fis differentiable there. 
a#0>a'>0. 


There are several grammatical variations, all of which mean the same thing as “if P, 
then Q”: 


Pimplies Q. 

P is sufficient for Q. 
Q provided that P. 
Q whenever P. 


In ordinary usage the statement “if P, then Q” means that the truth of P guarantees 
the truth of Q. Consequently, 


“P = Q” is a true statement when both P and Q are 
true and false when P is true and Q is false. 


Although the situation rarely occurs, we must sometimes deal with the statement 
“P => Q” when P is false. For example, consider this campaign promise: “If I am elected, 
then taxes will be reduced”. If the candidate is elected (P is true), the truth or falsity of 
this statement depends on whether or not taxes are reduced. But what if the candidate 
is not elected (P is false)? Regardless of what happens to taxes, you can’t fairly call 
the campaign promise a lie. Consequently, it is customary in symbolic logic to adopt 
this rule: 


When P is false, the statement “P = Q” is true. 


The contrapositive of the conditional statement “P = Q” is the statement “not-Q 
=> not-P”. For instance, the contrapositive of this statement about integers 


If cis a multiple of 6, then cis even 
is the statement 


If cis not even, then c is not a multiple of 6. 
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Notice that both the original statement and its contrapositive are true. Two statements 
are said to be equivalent if one is true exactly when the other is. We claim that 


The conditional statement “P = Q” is equivalent 
to its contrapositive “not-Q = not-P”. 


To prove this equivalence, suppose P = Q is true and consider the statement not-Q = 
not-P. Suppose not-Q is true. Then Q is false. Now if P were true, then Q would neces- 
sarily be true, which is not the case. So P must be false, and, hence, not-P is true. Thus 
not-Q = not-P is true. A similar argument shows that when not-Q = not-P is true, then 
P= Qis also true. 

The converse of the conditional statement “P = Q” is the statement “Q = P”. For 
example, the converse of the statement 


If b is a positive real number, then 2” is positive 
is the statement 
If B’ is positive, then b is a positive real number. 


This last statement is false since, for example, (~3)* is the positive number 9, but —3 
is not positive. Thus 


The converse of a true statement may be false. 


There are some situations in which a conditional statement and its converse are 
both true. For example, 


If the integer k is odd, then the integer k + 1 is even 
is true, as is its converse 
If the integer k + 1 is even, then the integer k is odd. 


We can state this fact in succinct form by saying that “k is odd if and only if k + 1 is 
even”. More generally, the statement 


P if and only if Q, 
which is abbreviated as “P iff Q” or “P < Q”, means 
P=+Q and Q=>P. 


“P if and only if Q” is called a biconditional statement. The rules for compound state- 
ments show that “P if and only if Q” is true exactly when both P = Q and Q = Pare 
true. In this case, the truth of P implies the truth of Q and vice versa, so that P is true 
exactly when Q is true. In other words, “P if and only if Q” means that P and Q are 
equivalent statements. 


Theorems and Proof 


The formal development of a mathematical topic begins with certain undefined terms 
and axioms (statements about the undefined terms that are assumed to be true). These 
undefined terms and axioms are used to define new terms and to construct theorems 
(true statements about these objects). The proof of a theorem is a complete justifica- 
tion of the truth of the statement. 
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Most theorems are conditional statements. A theorem that is not stated in condi- 
tional form is often equivalent to a conditional statement. For instance, the statement 


Every integer greater than 1 is a product of primes 


is equivalent to 
If n is an integer and n > 1, then n is a product of primes. 


The first step in proving a theorem that can be phrased in conditional form is 
to identify the hypothesis P and the conclusion Q. In order to prove the theorem 
“P => Q”, one assumes that the hypothesis P is true and then uses it, together with 
axioms, definitions, and previously proved theorems, to argue that the conclusion Q is 
necessarily true. 


Methods of Proof 


Some common proof techniques are described below. While such summaries are help- 
ful, there are no hard and fast rules that give a precise procedure for proving every 
possible mathematical statement. The methods of proof to be discussed here are in 
the nature of maps to guide you in analyzing and constructing proofs. A map may not 
reveal all the difficulties of the terrain, but it usually makes the route clearer and the 
journey easier. 


DIRECT METHOD This method of proof depends on the basic rule of logic 
called modus ponens: If R is a true statement and “R = S” is a true conditional 
statement, then S is a true statement. To prove the theorem “P = Q” by the direct 
method, you find a series of statements P, P} . - . , P, and then verify that each of 
the implications P = P}, P, => Pz, P = Ps... , Pa; => Py, and P, = Q is true. Then 
the assumption that P is true and repeated use of modus ponens show that Q is true. 

The direct method is the most widely used method of proof. In actual practice, it 
may be quite difficult to figure out the various intermediate statements that allow you 
to proceed from P to Q. In order to find them, most mathematicians use a thought 
process that is sometimes called the forward-back ward technique. You begin by work- 
ing forward and asking yourself, What do I know about the hypothesis P? What facts 
does it imply? What statements follow from these facts? And so on. At this point you 
may have a list of statements implied by P whose connection with the conclusion Q, 
if any, is not yet clear. 

Now work backward from Q by asking, What facts would guarantee that Q is 
true? What statements would imply these facts? And so on. You now have a list of 
statements that imply Q. Compare it with the first list. If you are fortunate some state- 
ment will be on both lists, or more likely, there will be a statement S on the first list and 
a statement Ton the second, and you may be able to show that S = T. Then you have 
P > S and S > T and T= Q, so that P > Q. 

When you have used the forward-backward technique successfully to find a proof 
that P = Q, you should write the proof in finished form. This finished form may look 
quite different from the thought processes that led you to the proof. Your thought 
process jumped forward and backward, but the finished proof normally should begin 
with P and proceed in step-by-step logical order from P to S to T to Q. The fin- 
ished proof should contain only those facts that are needed in the proof. Many state- 
ments that arise in the forward- backward process turn out to be irrelevant to the final 
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argument, and they should not be included in the finished proof. As illustrated in most 
of the proofs in this book, the finished proof is usually written as a narrative rather 
than a series of conditional statements. 


CONTRAPOSITIVE METHOD Since every conditional statement is equiv- 
alent to its contrapositive, you may prove “not-Q => not-P” in order to conclude that 
“P = Q” is true. For example, instead of proving that for a certain function f, 


If a + b, then f(a) # f(b) 
you can prove the contrapositive 


If f(a) = Kb), then a = b. 


PROOF BY CONTRADICTION Suppose that you assume the truth of a 
statement R and that you make a valid argument that R = S (that is, R = S is a true 
statement). If the statement S is in fact a false statement, there is only one possible 
conclusion: The original statement R must have been false, because a true premise R 
and a true statement R = S lead to the truth of S by modus ponens. 

In order to use this fact to prove the theorem “P= Q”, assume as usual that P is a 
true statement. Then apply the argument in the preceding paragraph with R = not-Q. 
In other words, assume that not-Q is true and find an argument (presumably using P 
and previously proved results) that shows not-Q = S, where S is a statement known to 
be false. Conclude that not-Q must be false. But not- is false exactly when Q is true. 
Therefore, Q is true, and we have proved that P = Q. Once again, the hard part will 
usually be finding the statement S and proving that not-Q implies S. 


EXAMPLE 1 


Recall that an integer is even if it is a multiple of 2 and that an integer that is 
not even is said to be odd. We shall use proof by contradiction to prove this 
statement 


If n? is even, then m is even. 


Here Pis the statement “m? is even” and Q is the statement “m is even”. We 


assume “m is not even” or equivalently “m is odd” (statement not-Q). But every 
odd integer is 1 more than some even integer. Since every even integer is a mul- 
tiple of 2, we must have m = 2k + 1 for some integer k. Then the basic laws of 
arithmetic show that 


mt? = (2k + 1} = 4 + 4k +1 = Ue + 2k) + 1. 


This last statement says that m? is 1 more than a multiple of 2, that is, m? is odd. 
But we are given that m? is even (statement P), and, hence, “m? is both odd and 
even” (statement S). This statement is false since no integer is both odd and even. 
Therefore, our original assumption (not-Q) has led to a contradiction (the false 
statement S). Consequently, not-Q must be false, and, hence, the statement “rm is 
even” (statement Q) is true. 
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In Example 1 various statements were labeled by letters so that you could easily 
relate the example to the general discussion. This is not usually done in proofs by 
contradiction, and such proofs may not be given in as much detail as in this example. 

The choice of a method of proof is partly a matter of taste and partly a question of ef- 
ficiency. Although any of those listed above may be used, one method may lead to a much 
shorter or easier-to-follow proof than another, depending on the circumstances. In addi- 
tion there are methods of proof that can be applied only to certain types of statements. 


PROOF BY INDUCTION This method is discussed in detail in Appendix C. 


CONSTRUCTION METHOD This method is appropriate for theorems that 
include a statement of the type “There exists a such-and-such with property so-and- 
so”. For instance, 


There is an integer d such that d? — 4d — 5 =0. 


If r and s are distinct rational numbers, then there is a rational number between r 
and s. 


If r is a positive real number, then there is a positive integer m such that m <r. 


To prove such a statement, you must construct (find, build, guess, etc.) an object with 
the desired property. When you are reading the proof of such a statement, you need 
only verify that the object presented in the proof does in fact have the stated property. 
An existence proof may amount to nothing more than presenting an example (for 
instance, the integer 2 provides a proof of “there exists a positive integer”). But more 
often a nontrivial argument will be needed to produce the required object. 


Caution Although an example is sufficient to prove an existence state- 
ment, examples can never prove a statement that directly or indirectly 
involves a universal quantifier. For instance, even if you have a million 
examples for which this statement is true: 


If c is an integer, then c? — e + 11 is prime, 
you will not have proved it. For the statement says, in effect, that for every 
integer c, a certain other integer is prime. This is mot the case when c = 


12 since 12? — 12 + 11 = 143 = 13- 11. So the statement is false. This 
example demonstrates that 


A counterexample is sufficient to disprove a statement. 


The moral of the story is that when you are uncertain if a statement is true, try to 
find some examples where it holds or fails. If you find just one example where it fails, 
you have disproved the statement. If you can find only examples where the statement 
holds, you haven’t proved it, but you do have encouraging evidence that it may be true. 


Proofs of Multiconditional Statements 


In order to prove the biconditional statement “P if and only if Q”, you must prove 
both “P => Q” and “Q = P”. Proving one of these statements and failing to prove the 
other is a common student mistake. For example, the proof of 


A triangle with sides a, b, cis a right triangle with 
hypotenuse ¢ if and only if c? = a? + b? 
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consists of two separate parts. First you must assume that you have a right triangle 
with sides a, b and hypotenuse c and prove that ê = a’ + b?. Then you must give a 
second argument: Assume that the sides of a triangle satisfy c? = a? + b? and prove 
that this is a right triangle with hypotenuse c. 

A statement of the form 


The following conditions are equivalent: P, Q, R, S, T 


is called a multiconditional statement and means that any one of the statements P, Q, 

R, S, or T implies every other one. Thus a multiconditional statement is just shorthand 

for a list of biconditional statements; P <> Q and P <4 Rand P + Sand P + T and 

Q < Rand Q <% S, etc. To prove this multiconditional statement you need only prove 
P= Qand Q = Rand Rs Sand S= Tand T= P. 


All the other required implications then follow immediately; for instance, from T= P 
and P = Q, we know that T= Q, and similarly in the other cases. 


EXAMPLE 2 


In order to prove this theorem about integers: 

The following conditions on a positive integer p are equivalent: 

(1) pis prime 

(2) If pisa factor of ab, then p is a factor of aor p is a factor of b 
(3) Hp = rs thenr = tlors=+1. 


you must make three separate arguments. First, assume (1) and prove (2), so 
that (1) = (2) is true. Second, you assume (2) and prove (3), so that (2) => (3) 
is true. Finally, you must assume (3) and prove (1), so that (3) = (1) is true. Be 
careful: At each stage you assume only one of the three statements and use it 
to prove another; the third statement does not play a role in that part of the 
argument. 
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APPENDIX B 


Sets and Functions 


For our purposes, a set is any collection of objects; for example, 
The set Z of integers. 
The set of right triangles with area 24. 
The set of positive irrational numbers. 
The objects in a set are called elements or members of the set. If B is a set, the 


statement “b is an element of B” is abbreviated as “be B”. Similarly, “b ¢ B” means 
“b is not an element of B”. For example, if Z is the set of integers, then 


2E2 and a é@. 


There are several methods of describing sets. A set may be defined by verbal 
description as in the examples above. A small finite set can be described by listing all 
its elements. Such a list is customarily placed between curly brackets; for instance, 


{3, 7, —4, 9} or {a, b, c, r, $, t}. 


Listing notation is sometimes used for infinite sets as well. For example, {2, 4, 6, 8, . . .} 
indicates the set of positive even integers. Strictly speaking, this notation is ambiguous in 
the infinite case since it relies on everyone’s seeing the same pattern and understanding 
that it is to continue forever. But when the context is clear, no confusion will result. 
Finally, a set can be described in terms of properties that are satisfied by its elements, 
and by these elements only. This is usually done with set-builder notation. For example, 


{x | x is an integer and x > 9} 


denotes the set of all elements x such that x is an integer greater than 9. In general, the 
vertical line is shorthand for “such that” and “{y | P}” is read “the set of all elements 
y such that P”. Thus each of the following is the set of even integers: 


{x | x is an even integer}. 
{t| tEZ and tis even}. 
{r|r EZ and ris a multiple of 2}. 
{y |y EZ and y = 2k for some integer k}. 
509 
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The Empty Set 


Some special cases of set-builder notation lead to an unusual set. For instance, the set 
{x | x is an integer and 0 < x < 1} 


has no elements since there is no integer between 0 and 1. The set with no elements is 
called the empty set or null set and is denoted Ø. For every element ¢, 


cEMisfalse and cg ø is true. 


The empty set is a very convenient concept to have around, but some care must be 
taken when dealing with theorems that are true only for nonempty sets (that is, sets that 
have at least one element). 


Subsets 


A set Bis said to be a subset of a set C (written B & C) provided that every element of 
Bis also an element of C. In other words, B & C exactly when this statement is true: 


xEB=>xeEC. 


For example, the set of even integers is a subset of the set Z of all integers, and the set 
of rational numbers is a subset of the set of real numbers. 

The definition of “B & C” allows the possibility that B = C (since it is certainly 
true in this case that every element of B is also an element of C). In other words, 


B & B for every set B. 


If Bis a subset of Cand B # C we say that B is a proper subset of C and write B Ç G. 
The subset relation is easily seen to be transitive, that is, 


If B& Cand CCD, then BCD. 


Two sets B and C are equal when they have exactly the same elements. In this case 
every element of B is an element of C and every element of C i an element of B. Thus, 


B=C_ if and only if Bo Cand CCB. 


This fact is the most commonly used method of proving that two sets are equal: Prove 
that each is a subset of the other. 

Basic logic leads to a surprising fact about the empty set. Since the statement 
x €@ is always false, the implication 


xEO=>xEC 


is always true (see Appendix A). But this is precisely the definition of “Ø is a subset 
of C”. So 


the empty set Ø is a subset of every set. 
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Operations on Sets 


We now review the standard ways of constructing new sets from given ones. If B and 
C are sets, then the relative complement of C in B is denoted B — C and consists of the 
elements of B that are not in C. Thus 


B-C={x|xeBandx¢ C}. 


For example, if E is the set of even integers, then Z — E is the set of odd integers. 
The intersection of sets B and C consists of all the elements that are in beth B and 
Cand is denoted BN C. Thus 


BO C= {x|x € Band x € C}. 


For example, if B = {—2, 1, V2, 5, m} and C is the set of positive rational numbers, 
then BM C = {1,5} since 1 and 5 are the only elements in both sets. If B is the set of 
positive integers and C the set of negative integers, then B N C = Ø since there are no 
elements in both sets. When B and C are sets such that B N C = Ø, we say that B and 
C are disjoint. 

The union of sets B and C consists of all elements that are in at least one of B or 
Cand is denoted B U C. Thus, 


BU C= {x|x € Borx eC}. 


For example, the union of B = {1, 3, 5, 7} and C = {-1, 1, 4, 9} is BU C= 
{-1, 1, 3, 4, 5, 7, 9}. If B is the set of rational numbers and C is the set of irrational 
numbers, then B U C is the set of all real numbers. 

You should verify that union and intersection have the following properties. For 
any sets B, C, and D, 


BUB=B BOB=B 
BUØ=B BAD =Ø 
BUC=CUB BNC=CNB 
BOBUC BAN CSB 


BSC ifandonly if BUC=C 
B&C if and only if BNC=B 
BU(CUD)=(BUQUD BN(CND)=(BNQND 
BN(CUD)=(BN OQ U(BND) 
BU(CN D)=(BUQN(BUD). 


The concepts of union and intersection extend readily to large, possibly infinite, 
collections of sets. Suppose that J is some nonempty set (called an index set) and that 
for each i E J, we are given a set A,. Then the intersection of this family of sets (denoted 
N 4A; ) is the set of elements that are in all the sets 4, that is, 

-e 


pR A; = fx |x € A; for every i € I}. 


Similarly, the union of thia family of sets (denoted Y A) is the set of elements that 
are in at least one of the sets A,, that is, 


U A;= {x |x € A, for some j € I}. 
1E 
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The Cartesian product of sets Band Cisdenoted B X Cand consists of all ordered 
pairs (x, y) with x E B and yE C. Equality of ordered pairs is defined by this rule: 


(x, y) = (u, v) if and only if = uin Band y = vin C. 
For example, if B = {r, s, t} and C = {5,7}, then B X Cis the set 


{(r, 5); (C, 7), (s, 5), (s; 7), (4 5); (4 7}. 


The set R of real numbers is sometimes identified with the number line. When this is 
done, the Cartesian product R x R is just the ordinary coordinate plane, the set of all 
points with coordinates (x, y) where x, yER. 

The Cartesian product of any finite number of sets B,, Bo,..., B, is defined in a 
similar fashion. B; X B, X +++ X B, is the set of all ordered n-tuples (x1, X2,. . . s Xp) 
where xE B, for each i= 1, 2,..., n. For example, if B = {0, 1}, Z is the set of integers, 
and R the set of real numbers, then B X Z X R is the set of all ordered triples of the 
form (0, k, r) and (1, k, r) with kE Z and rE R. The product B X Z X R is an infinite 
set; among its elements are (0, —5, 3), (1, 24, ar), and (1, 1, — 4⁄3). 


Functions 


A function (or map or mapping) f from a set B to a set C (denoted f:B — C) is a rule 
that assigns to each element b of B exactly one element c of C; ¢ is called the image of 
b or the value of the function fat b and is usually denoted f(b). The set B is called the 
domain and the set C the range of the function f. 

Your previous mathematics courses dealt with a wide variety of functions. For 
instance, if R is the set of real numbers, then each of the following rules defines a function 
from R to R: 


f(x) = cos x, g(x) =x +1, h(x) = x° — 5x + 2. 


The rule of a function need not be given by an algebraic formula. For instance, consider 
the function f:Z — {0, 1}, whose rule is 


f(x) = 0 if x is even and f(x) = | if x is odd. 


If B is a set, then the function from B to B defined by the rule “map every element to 
itself” is called the identity map on B and is denoted eg. Thus eg:B — B is defined by 


p(x) = x for every xE B. 


Composition of Functions 


Let fand g be functions such that the range of fis the same as the domain of g, say 
J:B — C and g:C > D. Then the composite of f and g is the function 4:B —> D whose 
rule is 


A(x) = gC). 
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In other words, the composite function is obtained by first applying f and then 


applying g: 
pag San 
x — f(x) — (f). 


Instead of h, the usual notation for the composite function of fand g is g ° f (note the 
order). Thus, g ° f:B > D is defined by (g ° f x) = g( f(x). 


EXAMPLE 1 


Let E be the set of even integers and N the set of nonnegative integers. Let 
J-E — Z be defined by f(x) = x/2 (since x is even, x/2 is an integer). Let g:Z > N 
be given by g(n) = n?. Then the composite function g ° f:E — N has this rule: 


(g ° PX% = 8A) = gx/2) = (x/2 = x2/4. 


The composite function in the opposite order, f ° g (first apply g, then f), is not 
defined since the range of g is not the same as the domain of f. For instance, 
g(3) = 9, but the domain of f is the set of even integers; even though the rule of 
f makes sense for odd integers, f(g (3)) = f(9) = 9/2, which is not in Z. 


EXAMPLE 2 


Let fZ— Z and g:Z > Z be given by f(x) = x — 1 and g(x) = x’. Then the 
composite function f ° g:Z — Z is given by the rule 


(f° a(x) = f(g) =f?) = 7 - 1. 


In this case the composite function in the opposite order g e fis also defined; its 
tule is 


(e ef Xx) = A fd) = g(x- 1) = (x - IP = 2 - 2x + 1. 
Thus we have, for instance, 
(f° g\3)=9-1=8 but (g°f)3)=9-64+1=4. 


So even though both are defined, f © g is not the same function as g ° f. 


Two functions k:B — C and k:B-— C are said to be equal provided that 
h(b) = k(b) for every bE B. 


EXAMPLE 3 


Let f:B — C be any function and sœC — C the identity map on C. Then 
tc ° f:B > C, and for every be B 


Cece SG) = tc( F) = fl). 
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Therefore uc ° f= f. Similarly, if ¿g is the identity map on B, then f° tg:B > C, 
and for every be B 


(f° tBXb) = fe) =f). 
Consequently, 
If f:B —> C, then icof=f and for=fe 


If f:B— C, g:C — D, and h:D — E are functions, then each of the com- 
posite functions (f° g) ° hand f ° (g ° h) is a map from B to E. We claim that 


(fo s)oh= fo(gof). 


The proof of this statement is simply an exercise in using the definition of 
composite function. For each b € B 


KCS © 8) ° hb) = (f © gyh)) = f le hb) 
and 
[S° (8 ° hb) = fiC © h) (b) = f ek. 


Since the right sides of the two equalities are identical, the composite functions 
(f° g) ° hand f e (g ° h) have the same effect on each bE B, which proves the 
claim. 


Binary Operations 


Informally we can think of a binary operation on the integers, for example, as a rule 
for producing a new integer from two given ones. Ordinary addition and multiplica- 
tion are operations in this sense: Given a and b we get a + b and ab. Producing a new 
integer from a pair of given ones also suggests the idea of a function. Addition of 
integers may be thought of as the function f from Z X Z to Z whose rule is 


f(a, b)=a+b. 


Similarly, multiplication can be thought of as the function g:Z X Z — Z given by 
g(a, b) = ab. 

With the preceding examples in mind we make this formal definition. A binary 
operation on a nonempty set B (usually called simply an operation on B) is a function 
f-B X B — B. The familiar examples suggest a new notation for the general case. We 
use some symbol, say +, to denote the operation and write a + b instead of f(a, b). 


EXAMPLE 4 


As we saw above, ordinary addition and multiplication are operations on Z. 
Another operation on Z is defined by the function f:Z X Z — Z whose rule is 
f(a, b) = ab — 1. If we denote this operation by +, then 3 * 5 = 15 — 1 = 14, 
and, similarly, 


12*4=47 -7*4=—-29 0+#8=-1, 
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Note that a + b = ab — 1 = ba ~ 1 = b + a, so that the order of the elements 
doesn’t matter when applying +, as is the case with ordinary addition and 
multiplication (the technical term for this property is commutativity). On the 
other hand, 


(l#2)*#3=14#3=2 but 1*(2*3)=145=4, 


so that (a + b) + c + a * (b * c) in general. Thus ¢ is not associative as are addition 
and multiplication (meaning that (a + b) + c = a + (b + c) and (ab)c = a(be) 
always). 


EXAMPLE 5 


Let S be a nonempty set. If f:S — S and g:S — S are functions, then their 
composite f ° g is also a function from S to S. So if B is the set of all functions 
from S to S, then composition of functions is an operation on the set B. In 
other words, the map that sends (f, g) to f° g is a function from B X B to B. 
The discussion of composite functions above shows that the operation ° on B 
is associative (that is, (f° g)° A = f ° (g ° A) always) but not commutative 

(f ° g need not equal g ° f). 


Let + be an operation on a set B and CS B. The subset C is said to be closed under 
the operation + provided that 


Whenever a, bE C, then a * bEC. 


Consider, for example, the operation of ordinary multiplication on the set B of posi- 
tive real numbers. Let C be the subset of positive integers. Then C is closed under the 
operation since ab is a positive integer whenever a and b are. But when the operation 
on B is ordinary division, then C is not closed: If a and b are integers, a + b need not 
be an integer (for instance, 3 + 7 = 3/7¢C). 

If + is an operation on a set B, then B (considered as a subset of itself) is closed 
under * by the definition of an operation. Nevertheless many texts, including this 
one, routinely list the closure of B under + as one of the properties of the operation. 
Although this isn’t logically necessary, it calls your attention to the importance of 
closure and reminds you that closure cannot be taken for granted for subsets other 
than B. 


Injective and Surjective Functions 


A function f:B — C is said to be injective (or one-to-one) provided f maps distinct 
elements of B to distinct elements of C, or in functional notation: If a + b in B, then 
f(a) + f(b) in C. This rather awkward statement is equivalent to its contrapositive, so 
that we have this useful description: 


S:B — Cis injective provided that 
whenever f(a) = f(b) in C, then a = b in B. 
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EXAMPLE 6 


Let R be the set of real numbers. In order to show that the function f:R > R 
given by f(x) = 2x + 3 is injective, we assume that f(a) = f(b), that is, 


2a +3 = 2b + 3. 


Subtracting 3 from each side shows that 2a = 2b; dividing both sides by 2 we 
conclude that a = b. Therefore, fis injective. 


EXAMPLE 7 


The map f:Z — Z given by f(x) = x is not injective because we have f (—3) = 9 = 
SQ), but —3 + 3. Alternatively, the distinct elements 3 and —3 have the same image. 


A function f:B — C is said to be surjective (or onto) provided that every 
element of C is the image under fof at least one element of B, that is, 


For each cE C there exists bE B such that f(b) = c. 


EXAMPLE 8 


Let N be the set of nonnegative integers and f:Z —> N the function given by 
f(x) = |x. Then fis surjective since every element of N istheimage under f of 
at least one element of Z (namely itself). Note, however, that fis not injective 
since, for example, f(1) = f(—1). 


EXAMPLE 9 


Let E be the set of even integers and consider the map g:Z — E given by g(x) = 
4x. We claim that the element 2 in £ is not the image under g of any element 

of Z. If 2 = g(b) for some b EZ, then 2 = 48, so that 1 = 25. This is impos- 
sible since 1 is not an integer multiple of 2. Therefore, g is not surjective. Note, 
however, that g is injective since 4a = 4b (that is, g(a) = g(b)) implies that a = b. 


EXAMPLE 10 


Let R be the set of real numbers and f: R — R the function given by 

f(x) = 2x + 3. To prove that fis surjective, let cE R; we must find be R such 
that f(b) = e. In other words, we must find a number ò such that 2b + 3 = c. 
To do so, we solve this last equation for b and find b = Z, Then 


VORE 


also injective (see Example 6). 


) +3=c—3+3=c. Therefore, fis surjective. The map fis 


The preceding examples demonstrate that injectivity and surjectivity are indepen- 
dent concepts. One does not imply the other, and a particular map might have one, both, 
or neither of these properties. 
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If f:B — Cis a function, then the image of fis this subset of C: 
Im f= {e]e = f(b) for some b E B} = { f(b) | bE B}. 
For example, if f:Z — Z is given by f(x) = 2x, then Im fis the set of even integers since 
Im f = {f(x)|xeEZ} = {2x|x eZ}. Similarly, if g:Z — Z is given by g(x) = |x|, then 
Im gis the set of nonnegative integers. A map f: B — C is surjective exactly when every 
element of Cis the image of an element of B. Thus 


fB — C is surjective if and only if Im f= C. 


If f:B — C is a function and S is a subset of B, then the image of the subset S is 
the set 


F(S) = {c |c = f(b) for some bE S} = {f(b) |b ES}. 


If f:Z > Z is given by f(x) ='2x, for example, and S is the set of odd integers, then 
J(S) = {2x|x is odd} is the set of even integers that are not multiples of 4. If the subset 
S is the entire set B, then f(B) is precisely Im f. 


Bijective Functions 


A function f:B — C is bijective (or a bijection or one-to-one correspondence) provided 
that fis both injective and surjective. 


EXAMPLE 11 


Examples 6 and 10 show that the map /:R > R given by f(x) = 2x + 3is 
bijective. 


EXAMPLE 12 


The map/from the set {1, 2, 3, 4, 5} to the set {v, w, x, y, Z} given by 
M= J2-uw (=x ,O=~v 76) =2 


is easily seen to be bijective. 


The last example illustrates the fact that for any finite sets B and C, there is a bijec- 
tion from B to C if and only if B and C have the same number of elements. In par- 
ticular, if B is finiteand C ¢ B, then there cannot be a bijection from B to C. But the 
situation is quite different with infinite sets. 


EXAMPLE 13 


Let E be the set of even integers and consider the map f:Z — E given by 

f(x) = 2x. By definition every even integer is 2 times some integer, so f is surjec- 
tive. Furthermore, 2a = 2b implies that æ = b, so fis injective. Therefore, fis a 
bijection. In this case, a bit more is true. Define a map g:E — Z by g(u) = u/2; 
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this makes sense since u/2 is an integer when u is even. Consider the composite 
function go f:Z => Z: 
(8° f) = g( f(x) = g(2x) = 2x/2 = x. 


Thus (g ° f Xx) = x = ız (x) for every x, and the composite map g ° f is just the 
identity map zz on Z. Now look at the other composite, f ° g:E — E: 


(f° Du) = f (g(u)) = f(u/2) = 2(u/2) = u. 


Therefore, the composite map f ° g is the identity map iz. 


Example 13 illustrates a property that all bijective functions have, as we now prove. 


Theorem 8.1 
A function f:B — C is bijective if and only if there exists a function g:C > B 
such that 

g°f=z and fog = ig 


Proof» Assume first that fis bijective. Define g:C — B as follows. If ce C, then 
there exists b € B such that f(b) = c because f is surjective. Furthermore, 
since fis also injective, there is only one element > such that f(b) = c (for 
if f(b") = c, then f(b) = f(b’) implies b = b’). So we can define a function 
g:C — B by this rule: 


g(c) = b, where b is the unique element of B such that f(b) = c. 
Then g(c) = b exactly when f (b) = c. Thus for any ce C 
(f° sXe) = f(g) = f(b) = c, 


from which we conclude that f° g = uc. Similarly, for each u E B, f(u) is 
an element of C, say f(u) = v, and, hence, by the definition of g, we have 
g(v) = u. Therefore, 


E ° f)(u) = (fu) = gv) =u 


and g ° f = ıp. This proves the first half of our biconditional theorem. 
To prove the other half, we assume that a map g:C > B with the 
stated properties is given. We must show that fis bijective. Suppose f(a) = 


f(b). Then 
E) = a(f(0)) 
(e ° fa) = (g ° f Xb) 
ta (a) = eafb) 
a=b. 


Ee pe ERS ROE NP te ah Ar a rE E 


Functions 519 


Therefore, f(a) = f(b) implies a = b, and fis injective. To show that fis 
surjective, let c be any element of C. Then g(c) € Band f(g(c)) = 

(f° g)(c) = iele) = c. So we have found an element of B that f maps 
onto c (namely g(c)); hence, f is surjective. Therefore, f is bijective, and 
the theorem is proved. E 


If f:B — C isa bijection, then the map g in Theorem B.1 is called the 
inverse of f and is sometimes denoted by f~!. Reversing the roles of f 
and gin Theorem B.1 shows that the inverse map g of a bijection f is 

itself a bijection. 


E Exercises 


NOTE: Z is the set of integers, Q the set of rational numbers, and R the set of real 
numbers. 


A. 1. Describe each set by listing: 
(a) The integers strictly between —3 and 9. 
(b) The negative integers greater than —10. 
(c) The positive integers whose square roots are less than or equal to 4. 
2. Describe each set in set-builder notation: 
(a) All positive real numbers. 
(b) All negative irrational numbers. 
(c) All points in the coordinate plane with rational first coordinate. 
(d) All negative even integers greater than — 50. 
3. Which of the following sets are nonempty? 
(a) {rEQ|r’ = 2} 
(b) {rER|r° + 5r — 7 = 0} 
(c) {tEZ|6r? -t-1=0} 
4. Is Ba subset of C when 
(a) B= Zand C = Q? 
(b) B = all solutions of xê + 2x — 5 = 0 and C = Z? 
(c) B= {a, b, 7,9, 11, —6} and C = Q? 
5. If AS Band BSC, prove that ACC. 
6. In each part find B — C, B N C, and B U C: 
(a) B=Z,C=Q. (@)B=R,C=Q. 
(c) B= {a, b,c, 1, 2, 3, 4, 5}, C = {a, c, e, 2, 4, 6, 8}. 
7. List the elements of B x C when B = {a, b, c} and C = {0, 1, c}. 
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8. List the elements of A X B X C when A = {0, 1} and B, Care as in Exercise 7. 
9. Let A = {1,2, 3, 4}. Exhibit functions fand g from A to A such that fo g # g o f. 


10. 
11. 


12. 


13. 


14. 


17. 
18. 


19. 


Do Exercise 9 when A = Z. 

Is the subset B closed under the given operation? 

(a) B = even integers; operation: multiplication in Z. 
(b) B = odd integers; operation: addition in Z. 


(c) B = nonzero rational numbers; operation: division in the set of nonzero 
real numbers. 


(d) B = odd integers; operation + on Z, where a + b is defined to be the 
number ab — (a + b) + 2. 


Find the image of the function f when 

(a) FRR f(x) =. 

(b) f:Z> Q; f(x) = x- 1. 

(c) ffR > Rs f(x) = —x7 +1. 

Let B = {1, 2, 3, 4} and C = {a, b, c}. 

(a) List four different surjective functions from B to C. 

(b) List four different injective functions from C to B. 

(c) List all bijective functions from C to C. 

(a) Give an example of a function f that is injective but not surjective. 


(b) Give an example of a function g that is surjective but not injective. 


. Let B and C be nonempty sets. Prove that the function 


fBxC—CxB 
given by f(x, y) = (y, x) is a bijection. 


. List all the subsets of {1, 2}. Do the same for {1, 2, 3} and {1, 2, 3, 4}. Make 


a conjecture as to the number of subsets of an n-element set. [Don’t forget the 
empty set.] 


Verify each of the properties of sets listed on page 511. 


If a,b ER with a < b, then the set {rER |a = r < b} is denoted [a, b). Let N 
denote the nonnegative integers and P the positive integers. Find these unions 
and intersections: 


a Unaan apea 0) 
neN n 


neP 
(b) Uaes) (d) Aasa) 
neP[N n neP |n n 
Prove that for any sets A, B, C: 
AX(BUC)=(AX B)U(AXC) 
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20. Let A, B be subsets of U. Prove De Morgan’s laws: 
(a) U- (ANB) =(U—- A)U(U— B) 
b) U- (AUB) =(U- A) N(U- B) 
21. Prove that for any sets A, B, C: 
(A ~- B)U(B- A)=(AU B) - (ANB) 
22. If Cis a finite set, then |C| denotes the number of elements in C. If A and 
Bare finite sets, is it true that |4 U B| = |A| + |B}? 


23. Let R** denote the positive real numbers. Does the following rule define a 
function from R** to R: assign to each positive real number c the real number 
whose square is c? 


24. Determine whether the given operation on R is commutative (that is, a + b = 
b x a for all a, b) or associative (that is, a + (b + c) = (a * b) * c for all a, b,c). 


(a) a + b = 2® (b) a+b = ab? 
(c) at b=0 (d) a+ b = (a + by/2 
(e) ax b=] (f) ae#b=b 


(g:) atb=a +b 
25. Prove that the given function is injective. 
(a) Z> Z; f(x) = 
b) FR >R; f(x) = x? 
(c) £Z >Q; f(x) = x/7 
(d) AR > R; A(x) = -3x + 5 
26. Prove that the given function is surjective. 
(a) R> R; f(x) = 
(b) [Z >Z; f(x) =x-4 
(c) SR >R; f(x) = -3x + 5 
(d) f:Z X Z> Q; f(a, b) = alb when b + 0 and 0 when b = 0. 
27. Let f:B — C and g:C > D be functions. Prove: 
(a) If fand g are injective, then g ° f:B —> D is injective. 
(b) If fand g are surjective, then g fis surjective. 


28. (a) Let f:B — C and g:C > D be functions such that g e f is injective. Prove 
that fis injective. 


(b) Give an example of the situation in part (a) in which g is not injective. 


29. (a) Let f:B — Cand g:C—> D be functions such that g ° f is surjective. Prove 
that g is surjective. 


(b) Give an example of the situation in part (a) in which fis not surjective. 
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30. Let g:B X C—> C (with B # Ø ) be the function given by g (x, y) = y. 
(a) Prove that g is surjective. 
(b) Under what conditions, if any, is g injective? 


31. If f:B — C is a function, then f can be considered as a map from B to Im f 
since f (b) € Im f for every b E B. Show that the map f:B > Im f is surjective. 


32. Let B be a finite set and f:B > B is a function. Prove that f is injective if and 
only if fis surjective. 


33. Let f:B — C bea function and let S, T be subsets of B. 
(a) Prove that f(S U T) = f(S) U f(T). 
(b) Prove that (S N T)Cf(S) A KT). 
(c) Give an example where (S N T) + KS) N f(T). 


34. Prove that f:B — C is injective if and only if (S N T) = A(S) N f(T) for every 
pair of subsets S, T of B. 


35. Let f:B > Cand g:C > D be bijective functions. Then the composite function 
g°f:B— D is bijective by Exercise 27. Prove that (g o fy} = f'o g`! 
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Well Ordering and Induction 


We assume that you are familiar with ordinary arithmetic in the set Z of integers and 
with the usual order relation (<) on Z. The subset of nonnegative integers will be 
denoted by N. Thus 


N = {0, 1, 2,3,...}. 


Finally, we assume this fundamental axiom: 


WELL-ORDERING AXIOM Ervery nonempty subset of N contains a smallest 
element. 

Most people find this axiom quite plausible, but it is important to note that it 
may not hold if N is replaced by some other set of numbers; see page 3 of the text for 
examples. 

An important consequence of the Well-Ordering Axiom is the method of proof 
known as mathematical induction. It can be used to prove statements such as 


A set of n elements has 2" subsets. 


Denote this statement by the symbol P(n) and observe that there are really infinitely 
many statements, one for each possible value of x: 


P(0): A set of 0 elements has 2° = 1 subset. 
P(): A set of 1 element has 2! = 2 subsets. 
P(2): A set of 2 elements has 2? = 4 subsets. 
P(3): A set of 3 elements has 23 = 8 subsets. 
And so on. To prove the original proposition we must prove that 
P(n) is a true statement for every n EÑN. 


Here’s how it can be done. 
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Theorem C.1 The Principle of Mathematical Induction 


Assume that for each nonnegative integer n, a statement P(n) is given. If 


(i) P(0) is a true statement; and 
(ii) Whenever P(A) is a true statement, then PK + 1) is also true, 


then P(n) is a true statement for every n EN. 


The example of the number of subsets of a set of n elements is continued after the 
proof of the theorem. You may want to read that example now to see how Theorem C. 1 
is applied, which is quite different from the manner in which it is proved. 


Proof of Theorem 6.1 > Lets be the subset of N consisting of those integers j 
for which P( f) is false. To prove the theorem we need only show that 
Sis empty; we shall use proof by contradiction to do this. Suppose S 
is nonempty. Then by the Well-Ordering Axiom, S contains a smallest 
element, say d. Since P(d) is false by the definition of S and P(0) is true 
by property (i), we must have d # 0. Consequently, d = 1 (because dis a 
nonnegative integer), and, hence, d — 1 = 0, that is, d — 1 EN. Since 
d—1<dand dis the smallest element in S, d — 1 cannot be in S. 
Therefore, P(d — 1) must be wue (otherwise d — 1 would be in S). 
Property (ii) (with k = d — 1) implies that P((d — 1) + 1) = P(d) is also 
a true statement. This is a contradiction since d€ S. Therefore, S is the 
empty set, and the theorem is proved. W 


In order to apply the Principle of Mathematical Induction to a series of state- 
ments, you must verify that these statements satisfy both properties (i) and (ii). Note 
that property (ii) does not assert that any particular P(k) is actually true, but only that 
a conditional relationship holds: If P(k) is true, then P(k + 1) must also be true. So to 
verify property (ii), you assume the truth of P(X) and use this assumption to prove that 
P(k + 1) is true. As we shall see in the examples below, it is often possible to prove this 
conditional statement even though you may not be able to prove directly that a particu- 
lar P( J) is true. The assumption that P(k) is true is called the induction assumption or 
the induction hypothesis. 

You may have seen induction used to prove statements such as “the sum 


a i +1 
of the first n nonnegative integers is mae ix 


n(n+ 1)” = 
“O+1+24+3+-:-+n — = Although such examples make nice exercises 


here P(n) is the statement: 


for beginners, they are not typical of the way induction is used in advanced math- 
ematics. The examples below will give you a more comprehensive picture of inductive 
proof. They are a bit more complicated than the usual elementary examples but are 
well within your reach. 
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EXAMPLE 1 


We shall use the Principle of Mathematical Induction to prove that for each n = 0, 
A set of n elements has 2" subsets. 


If n = 0, then the set must be the empty set (the only set with no elements). Its one and 
only subset is itself (since Ø is a subset of every set). So the statement 


P(0): A set of 0 elements has 2° = 1 subset 


is true (property (i) holds). 
In order to venfy property (ii) of Theorem C. 1, we assume the truth of 


P(k): A set of k elements has 2* subsets 
and use this induction hypothesis to prove 
P(k + 1): A set of k + 1 elements has 2**? subsets. 


To do this, let T be any set of k + 1 elements and choose some element c of T. Every 
subset of T either contains c or does not contain c. The subsets of T that do not 
contain c are precisely the subsets of T — {c}. Since the set T — {c} has one fewer 
element than T, it is a set of k elements and, therefore, has exactly subsets (becamse 
the induction hypothesis P(k) is assumed true). Now every subset of T that contains 
cmust be of the form {c} U D, where D isa subset of T — {c}. There are 2* possible 
choices for D and, hence, * subsets of T that contain c. Consequently, the total num- 
ber of subsets of Tis 

gins of ee) ‘eee es of subsets y ee 


that contain ¢ do not contain c 


Thus any set Tof k + 1 elements has 2** subsets, that is, P(k + 1) is a 

true statement. We have now verified property (ii) and can, therefore, apply 
Theorem C.l to conclude that P(n) is true for every n € N; that is, every set of n 
elements has 2” subsets. 


The Principle of Mathematical Induction cannot be conveniently used on certain 
propositions, eyen though they appear to be suitable for inductive proof. In such cases 
a variation on the procedure is needed: 


Theorem 6.2 The Principle of Complete Induction 


Assume that for each nonnegative integer n, a statement P(n} is given. If 


(i) P(0) is a true statement; and 


(ii) Whenever P(/) is a true statement for all j such that 0 = j < £, then 
P(t) is also true, 


then P(n} is a true statement for every n EN. 
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Although commonly used, the title “complete induction” is a bit of a misnomer 
since, as we shall see, this form of induction is equivalent to the previous one. 


Proof of Theorem 0.2» For each ne N, let O(n) be the statement 
P( J) is true for all J such that 0 = J = n. 


Note carefully that the last inequality sign in this statement is = and not <. 
We shall use the Principle of Mathematical Induction (Theorem C. 1) to show 
that O(n) is true for every n E N. This will mean, in particular, that P(n) is true 
for every n € N. Now Q(0) is the statement 


P( J) is true for all j such that 0 = j = 0. 


In other words, Q(0) is just the statement “P(0) is true”. But we know 
that this is the case by hypothesis (i) in the theorem. Suppose that Q(k) is 
true, that is, 


P( J) is true for all J such that 0 sj sk. 


By hypothesis (ii) (with t = k + 1), we conclude the P(k + 1) is also true. 
Therefore, P( j) is true for all j such that 0 =J = k + 1, that is, O(k + 1) 
is a true statement. Thus we have shown that whenever Q(k) is true, then 
Q(k + 1) is also true. By the Principle of Mathematical Induction, Q(x) 
is true for every nEN, and the proof is complete. E 


In the formal description of induction (either principle), the notation P(n) is quite 
convenient. But it is rarely used in actual proofs by induction. The next example is 
more typical of the way inductive proofs are usually phrased. But even here we include 
more detail than is customary in such proofs. 


EXAMPLE 2 


We shall use the Principle of Complete Induction to prove: 
If n,b EN and b > 0, then there exist q, rE N such that 
(+) n=bą+r and Osr<b. 


This statement (called the Division Algorithm for nonnegative integers) is just a 
formalization of grade-school long division: When z is divided by b, there is a quotient 
qand remainder r (smaller than the divisor 5) such that n = bq + r; see the discussion 
on page 4 of the text. 

Statement (+) is true for n = 0 and any positive b (let g = 0 and r = 0). So property 
G) of Theorem C.2 holds. Suppose that (+) is true for all z such that 0 = n < t (this is the 
induction hypothesis). We must show that (+) is true for n = t. If t < b, then t = 50 + t, 
so (+) is true with q = O and r = t. If b = ż, then 0 = ¢ — b < t, and by the induction 
hypothesis, (+) is true for n = t — b. Therefore, there exist integers q, and 7, such that 


t-b=qbt+n and 0sr<5. 
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Consequently, 
t=b+qbt+n=A+q)o+n and Os7,<b5, 


Therefore, (+) is true for n = t (with q = 1 + q, and r = r,). Hence, property 
(ii) of Theorem C.2 is satisfied. By the Principle of Complete Induction, (+) is 
true for every nEN. 


Some mathematical statements are false (or undefined) for n = 0 or other small 
values of n but are true for n = r and all subsequent integers. For instance, it can be 
shown that 

3n >n + 1 for every integer n = 1. 
2" > n + 2 for every integer n = 5. 
Such statements can often be proved by using a variation of mathematical induction 
(either principle): 
In order to prove that statement Pà») is true for each integer # = r, 


follow the same basic procedure as before, 
starting with P(r) instead of P(0). 


The validity of this procedure is a consequence of 


Theorem 6.3 


Let r be a positive integer and assume that for each n = r a statement P(n) 
is given. If 


(i) P(r) is a true statement; 
and either 
(ii) Whenever k= rand P(k) is true, then P(k + 1) is true; 


or 


{ii} Whenever P(/) is true for ali j such that r = j < t, then P(é) is true, 
then P(n) is true for every n =r. 


Proof > Conditions (i) and (ii) are the analogue of Theorem C. 1. Verify that 
the proof of Theorem C.l. carries over to the present case verbatim if 
0 is replaced byr, 1 by r + 1, and N by the set N, = {n |nEN anda = r}. 
Conditions (i) and (ii^) are the analogue of Theorem C.2; its proof 
carries over similarly. M 


The final theorem to be proved here is not necessary in order to read the rest of 
the book. But it is a result that every serious mathematics student ought to know. 
It is also a good illustration of the fact that intuition can sometimes be misleading. 
Most people feel that the Well-Ordering Axiom is obvious, whereas the Principle of 
Complete Induction seems deeper and in need of some proof. But as we shall now see, 
these two statements are actually equivalent. Among other things, this suggests that 
the Well-Ordering Axiom is a good deal deeper than it first appears. 
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Theorem C.4 


The following statements are equivalent: 


(1) The Well-Ordering Axiom. 
(2) The Principle of Mathematical Induction. 
(3) The Principle of Complete Induction. 


Proof The proof of Theorem C.l shows that (1) = (2), and the proof of 
Theorem C.2 shows that (2) = (3). To prove (3) = (1), we assume the 
Principle of Complete Induction and let S be any subset of N. To prove 
that the Well-Ordering Axiom holds, we must show 


If S is nonempty, then S has a smallest element. 
To do so, we shall prove the equivalent contrapositive statement 
If S has no smallest element, then S is empty. 


Assume S has no smallest element; to prove that S is empty we need only 
show that the following statement is true for every n EN: 


(++) n is not an element of S. 


Since 0 is the smallest element of N, it is also the smallest element of any 
subset of N containing 0. Since S has no smallest element, 0 cannot be 
in S, and, hence, (++) is true when n = 0 (property (i) of Theorem C.2 
holds). Suppose (*#) is true for all j such that 0 = f < t. Then none of 
the integers 0, 1,2,..., £ — 1 is in S, or equivalently, every element in 
S must be greater than or equal to t. If t were in S, then t would be the 
smallest element in S since s = t for all s € S. Since S has no smallest 
element, f is not in S. In other words, (**) is true when n = t. Thus 

the truth of (#*) when j < t implies its truth for t (property (ii) of 
Theorem C.2 holds). By the Principle of Complete Induction, (++) is 
true for all n€ N. Therefore, S is empty, and the proof is complete. W 


E Exercises 


A. 1. Prove that the sum of the first n nonnegative integers is n(n + 1)/2. 
[Hint: Let P(k) be the statement: l 


O+1+2+-:-+k= kk + 1)/2] 
2. Prove that for each nonnegative integer n, 2" > n. 


3. Prove that 2*-! = n! for every nonnegative integer n. [Recall that 0! = 1 and 
forn > 0,nt=1+2-3---(n— In] 


4. Let r be a real number, r # 1. Prove that for every integer n = 1, 


"i 
ert eP a ee 
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B. 5. Prove that 4 is a factor of 7" — 3” for every positive integer n. 
[Hint: Tt! — 3+! = JAH — 7. 3h 4.7 - 3 — 3! = 77 — 3) + (7 ~ 3)3*] 


6. Prove that 3 is a factor of 4" — 1 for every positive integer n. 

7. Prove that 3 is a factor of 2"*! + 1 for every positive integer n. 

8. Prove that 5 is a factor of 2“~? + 1 for every positive integer n. 

9. Prove that 64 is a factor of 9" — 8n — 1 for every nonnegative integer n. 


10. Use the Principle of Complete Induction to show that every integer greater 
than | is a product of primes. [Recall that a positive integer p is prime 
provided that p > 1 and that the only positive integer factors of pare 1 
and p.] 


11. Let B bea set of elements, Prove that the number of different injective 
functions from B to B is n!. [n! was defined in Exercise 3.] 


12. True or false: n? — n + 11 is prime for every nonnegative integer n. Justify 
your answer. [Primes were defined in Exercise 10.] 


13. Let B be a set of n elements. 
(a) If n = 2, prove that the number of two-clement subsets of B is n(n — 1)/2. 
(b) If n = 3, prove that the number of three-element subsets of Bis n(n — I)(n — 2)/31. 


(c) Make a conjecture as to the number of k-clement subsets of B when n = k. 
Prove your conjecture. 


14. At a social bridge party every couple plays every other couple exactly once. 
Assume there are no ties. 


(a) If n couples participate, prove that there is a “best couple” in the following 
sense: A couple u is “best” provided that for every couple v, u beats v or u 
beats a couple that beats v. 


(b) Show by example that there may be more than one best couple. 


15. What is wrong with the following “proof” that all roses are the same color. 
It suffices to prove the statement: In every set of n roses, all the roses in 
the set are the same color. If n = 1, the statement is certainly true. Assume 
the statement is true for n = k. Let S be a set of k + 1 roses. Remove one 
rose (call it rose A) from S; there are k roses remaining, and they must all 
be the same color by the induction hypothesis. Replace rose A and remove 
a different rose (call it rose B). Once again there are k roses remaining that 
must all be the same color by the induction hypothesis. Since the remaining 
roses include rose A, all the roses in S have the same color. This proves that 
the statement is true when n = k + 1. Therefore, the statement is true for all 
n by induction. 
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16. Let n be a positive integer. Suppose that there are three pegs and on one of 
them z rings are stacked, with each ring being smaller in diameter than the 
one below it, as shown here for n = 5: 


The game is to transfer all the rings to another peg according to these rules: 
(i) only one ring may be moved at a time; (ii) a ring may be moved to any 

peg but may never be placed on top of asmaller ring; (iii) the final order of 
the rings on the new peg must be the same as their original order on the first 
peg. Prove that the game can be completed in 2" — 1 moves and cannot be 
completed in fewer moves. 


17. Let x be a real number greater than —1. Prove that for every positive integer n, 
(L +x = 1+ nx. 


C. 18. Consider maps in the plane formed by drawing a finite number of straight lines 
(entire lines, not line segments). Use induction to prove that every such map 
may be colored with just two colors in such a way that any two regions with 
the same line segment as a common border have different colors. Two regions 
that have only a single point on their common border may have the same color. 
[This problem is a special case of the so-called Four-Color Theorem, which 
states that every map in the plane (with any continuous curves or segments of 
curves as boundaries) can be colored with at most four colors in such a way 
that any two regions that share a common border have different colors.] 
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APPENDIX D 


Equivalence Relations 


This appendix may be read anytime after you’ve finished Appendix B, but it is not 
needed in the text until Section 10.4. If you read it before that point, you should 
have no trouble with Examples 1-3 but may have to skip some of the later examples. 
Chapter 2 is a prerequisite for the examples labeled “integers”, Chapter 6 for those 
labeled “rings”, and Section 8.1 for those labeled “groups”. 

If A is a set, then any subset of A X A is called a relation on 4. A relation Ton A 
is called an equivalence relation provided that the subset T is 


(i) Reflexive: (a, a)€ T for every ac A. 
(ii) Symmetric: If (a, b) € T, then (b, a) E T. 
(iii) Transitive: If (a, b)€ T and (b, c)e€ T, then (a, c) E T. 
If T is an equivalence relation on A and (a, b) € T, we say that a is equivalent to b and 
write a ~ b instead of (a, b) E T. In this notation, the conditions defining an equiva- 
lence relation become 
(i) Reflexive: a ~ a for every ac A. 
(ii) Symmetric: If a ~ b, then b ~ a. 
(iit) Transitive: If a ~ band b ~ c, then a ~ c. 


When this notation is used, the relation is usually defined without explicit reference to 
a subset of A X A. 


EXAMPLE 1 


Let A be a set and define a ~ b to mean a = b. In other words, the equivalence 
relation on A is the subset T = {(a, b) |a = b} of A X A. Then it is easy to see 
that ~ is an equivalence relation. 


EXAMPLE 2 


The relation on the set R of real numbers defined by 
r ~ s means |r| = |s| 
is an equivalence relation, as you can readily verify. 
531 
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EXAMPLE 3* 


Define a relation on the set Z of integers by 
a ~ b means a— b is a multiple of 3. 


For example, 17 ~ 5 since 17 — 5 = 12, a multiple of 3. Clearly a ~ a for every 
a since a — a = 0 = 3-0. To prove property (ii), suppose a ~ b. Then a —b is 

a multiple of 3. Hence, — (a — b) is also a multiple of 3. But — (a —8) = b-a. 
Therefore, b ~ a. To prove property (iii), suppose a ~ b and b ~ è. Then æ —b 
and b — care multiples of 3 and so is their difference (a — b) — (b — ¢) = a- e, 
so that a ~ e. Thus ~ is an equivalence relation (usually called congruence 
modulo 3 and denoted a = b (mod 3)). 


EXAMPLE 4 (INTEGERS) 


If n is a fixed positive integer, the relation of congruence modulo n on the set Z, 
defined by 


a = b (mod n) if and only if a —b is a multiple of n, 


is an equivalence relation by Theorem 2.1. 


EXAMPLE 5 (RINGS) 


If J is an ideal in the ring R, then the relation of congruence modulo 7, defined 
by 


a = b (mod J) if and only if a-b E J, 


is an equivalence relation on R by Theorem 6.4. 


EXAMPLE 6 (GROUPS) 


If K is a subgroup of a group G, then the relation defined by 
a= bif and only if ab! € K 


is an equivalence relation on G by Theorem 8.1. 


Caution It is quite possible to have a relation on a set that satisfies one or two, but 
not all three, of the properties that define an equivalence relation. For instance, the 
order relation = on the set R of real numbers is reflexive and transitive but not sym- 
metric; for other examples, see Exercises 8 and 9. Therefore, you must verify all three 
properties in order to prove that a particular relation is actually an equivalence relation. 


*if you've already read Section 2.1, skip Examples 3 and 8; it’s just congruence modulo n when n = 3. 
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Let ~ be an equivalence relation on a set A. If a E A, then the equivalence class of 
a (denoted [a]) is the set of all elements in A that are equivalent to a, that is, 


[a] = {b|b € A andb ~ a}. 
In Example 2, for instance, the equivalence class [9] of the number 9 consists of all real 
numbers b such that b ~ 9, that is, all numbers b such that |b| = |9} Thus [9] = {9, —9}. 


EXAMPLE 7 (RINGS, GROUPS) 


If Z is an ideal in a ring R, then an equivalence class under the relation of con- 

gruence modulo Tis a coset a + I= {a + i| i€ I}. Similarly, if K is a subgroup 
of a group G, then an equivalence class of the relation congruence modulo K is 
aright coset Ka = {ka|keE K}. 


EXAMPLE 8 


In Example 3, the equivalence class of the integer 2 consists of all integers b 
such that b ~ 2, that is, all b such that b — 2 is a multiple of 3. But b — 2 is 
a multiple of 3 exactly when 4 is of the form b = 2 + 3k for some integer k. 
Therefore, 


[2] = {2 + 3k | kKEZ} = {24+0,2+3,246,249,..3 
SEE oxigen Lp Dee 8 Mea 
A similar argument shows that the equivalence class [8] consists of all integers 
of the form 8 + 3k (k € Z); consequently, 
[8] = {..., —7, —4, —1, 2, 5, 8, 11, 14, 17, ...}. 
Thus [2] and [8] are the same set. Note that 2 ~ 8. This is an example of 


Theorem 0.1 


Let ~ be an equivalence relation on a set A and a, b EA. Then 


a~ cif and only if [a] = [c]. 


Proof* > Assume a ~ c. To prove that [a] = [c], we first show that [a] & [c]. To do 
this, let b € [a]. Then b ~ a by definition. Since a ~ c, we have b ~ c by 
transitivity. Therefore, b € [c] and [a] & [c]. Reversing the roles of a 
and c in this argument and using the fact that c ~ a by symmetry, show 
that [c] = [a]. Therefore, [æ] = [c]. Conversely, assume that [a] = [c]. Since 
a ~ a by reflexivity, we have a E [a], and, hence, a € [c]. The definition of 
[c] shows thata ~ c. M 


“If you've read Section 2.1, note that this proof and the proot of Corollary D.2 are virtually identical to 
the proofs of Theorem 2.3 and Corollary 2.4: just replace = by ~. 
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Generally when one has two sets, there are three possibilities: The sets are equal, 
the sets are disjoint, or the sets have some (but not all) elements in common. With 
equivalence classes, the third possibility cannot occur: 


Corollary 0.2 


Let ~ be an equivalence relation on a set A. Then any two equivalence 
classes are either disjoint or identical. 


Proof» Let [a] and [c] be equivalence classes. If they are disjoint, then there is 
nothing to prove. If they are not disjoint, then [a] N [cd] is nonempty, and 
by definition there is an element b such that b E [a] and bE [c]. By the 
definition of equivalence class, b ~ a and b ~ c. Consequently, by transi- 
tivity and symmetry, a ~ c. Therefore, [a] = [c] by Theorem D.I. M 


A partition of a set 4 is a collection of nonempty, mutually disjoint* subsets of 4 
whose union is 4. Every equivalence relation ~ on A leads to a partition as follows. 
Since a € [a] for each a E A, every equivalence class is nonempty, and every element of 
A is in one. Distinct equivalence classes are disjoint by Corollary D.2. Therefore, 


The distinct equivalence classes of an equivalence 
relation on a set A form a partition of A. 


Conversely, every partition of A leads to an equivalence relation whose equivalence 
classes are precisely the subsets of the partition (Exercise 21). 


E Exercises 


A. 1. Let P bea plane. If p, q are points in P, then p ~ q means p and q are the same 
distance from the origin. Prove that ~ is an equivalence relation on P. 


2. Define a relation on the set Q of rational numbers by: r ~ s if and only if 
r — s EŻ. Prove that ~ is an equivalence relation. 


3. (a) Prove that the following relation on the set R of real numbers is an 
equivalence relation: a ~ b if and only if cos a = cos b. 


(b) Describe the equivalence class of 0 and the equivalence class of 77/2. 


4. If mand x are lines in a plane P, define m ~ n to mean that m and z are 
parallel. Is ~ an equivalence relation on P? 


5. (a) Let ~ be the relation on the ordinary coordinate plane defined by 
(x, y) ~ (u, v) if and only if x = u. Prove that ~ is an equivalence relation. 


(b) Describe the equivalence classes of this relation. 


*Thatis, any two of the subsets are disjoint. 
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6. Prove that the following relation on the coordinate plane is an equivalence 
relation: (x, y) ~ (u, v) if and only if x — u is an integer. 


7. Let f:A — B bea function. Prove that the following relation is an equivalence 
relation of 4: u ~ vif and only if f(u) = f(v). 


8. Let 4 = {1, 2, 3}. Use the ordered-pair definition of a relation to exhibit a 
relation on A with the stated properties. 


(a) Reflexive, not symmetric, not transitive. 
(b) Symmetric, not reflexive, not transitive. 
(c) Transitive, not reflexive, not symmetric. 
(d) Reflexive and symmetric, not transitive. 
(e) Reflexive and transitive, not symmetric. 
(f) Symmetric and transitive, not reflexive. 


9. Which of the properties (reflexive, symmetric, transitive) does the given 
relation have? 


(a) a < bon the set R of real numbers. 
(b) AS Bon the set of all subsets of a set S. 
(©) a +b on the set R of real numbers. 
(d) (—1)° = (-1)’ on the set Z of integers. 
B. 10. If ris a real number, then fr] denotes the largest integer that is = r; for 


instance [r] = 3, [7] = 7 and [—1.5] = —2. Prove that the following relation is 
an equivalence relation on R: r ~ s if and only if [r] = [s]. 


11. Let ~ be defined on the set R* of nonzero real numbers by: a ~ b if and only 
if a/b € Q. Prove that ~ is an equivalence relation. 


12. Is the following relation an equivalence relation on R: a ~ b if and only if 
there exists k EZ such that a = 10%. 


13. In the set R[x] of all polynomials with real coefficients, define f(x) ~ g(x) if 
and only if f'(x) = g'(x), where’ denotes the derivative. Prove that ~ is an 
equivalence relation on R[x]. 

14. Let T be the set of all continuous functions from R to R and define f ~ g if 
and only if f(2) = g(2). Prove that ~ is an equivalence relation. 

15. Prove that the relation on Z defined by a ~ b if and only if a? = b? (mod 6) is 
an equivalence relation. 

16. Let S= {(@, b) |a, bEZ and b + 0} and define (a, b) ~ (c, d) if and only if 
ad = be. Prove that ~ is an equivalence relation on S. 

17. Let ~ be a symmetric and transitive relation on a set A. What is wrong 
with the following “proof” that ~ is reflexive: a ~ b implies b ~ a by 
symmetry; then a ~ b and b ~ aimplya ~ a by transitivity. [Also see 
Exercise 8(f).] 
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18.* Let G be a group and define a ~ b if and only if there exists c E€ G such that 
b = "lac. Prove that ~ is an equivalence relation on G. 


19.* (a) Let K be a subgroup of a group G and define a ~ b if and only if 
abe K. Prove that ~ is an equivalence relation on G. 


(b) Give an example to show that the equivalence relation in part (a) need 
not be the same as the relation in Example 6. 


20.* Let G be a subgroup of S,. Define.a relation on the set {1, 2, . . . , n} by 
a~ bif and only if a = a(b) for some ø in G. Prove that ~ is an equivalence 
relation. 


21. Let A beaset and {4,| i €T} a partition of A. Define a relation on A by: 
a~ b if and only if a and b are in the same subset of the partition (that is, 
there exists k EJ such that a E€ A, and b E A,). 


(a) Prove that ~ is an equivalence relation on A. 


(b) Prove that the equivalence classes of ~ are precisely the subsets A, of the 
partition. 


*Sections 7.2 and 7.3 are prerequisites for Exercises 18-20. 
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APPENDIX £E 


The Binomial Theorem 


Appendix C and Section 3.2 are the prerequisites for this appendix. The material 
presented here is used in Section 11.6 and in occasional exercises elsewhere. 
As we saw in Example 3 of Section 3.2, 


(a + bY = @ + 2ab + b? 


for any elements a, b in a commutative ring R. Similar calculations using distributivity 
and commutative multiplication show that 


(a + bY = @ + 3a°b + 3al? + b? 
(a + by = at + 4b + 6a?" + 4ab + Bt. 


There is a pattern emerging here, but it may not be obvious unless certain facts are 
pointed out first. 

Recall that 0! is defined to be 1 and that for each positive integer n, the symbol n! 
denotes the number n(n—1)(n—2)-++3-2-1. a a k, with 0 = k = n, the binomial 
coefficient (d is defined to be the number = This number may appear to 
be a fraction, but every binomial coefficient is actually an integer (Exercise 6). For 
: 4 4! _ Ae Zien2i ll at 4 4! 
instance, (i) = I4- Di 1-3-2:1 4, and similarly, (3) z221 = 6. Note 
that these numbers appear as coefficients in the preceding expansion of (a + b}; in 
fact, youcan readily verify that 


(a + bf = at + (e + (er + (r + BF, 


This is an example of 
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Theorem E.1 The Binomial Theorem 


Let R be a commutative ring and a, bE R. Then for each positive integer n, 
Pores n =t DY an-z on pn 
(a+b) =a + ype + 2] tee + n — 1)" +b. 


Proof > The proof is by induction on n. If n = 1, the theorem states that 


(a + b)! = a! + b’, which is certainly true. Assume that the theorem is 
true when n = k, that is, that 


(a+ aha da (Dew ee (Bate toot (eE j+ 


We must use this assumption to prove that the theorem is true when n = k + 1. By the 
definition of exponents (a + byt! = (a + bX(a + b¥.. Applying the induction hypoth- 
esis to (a + 5)* and using distributivity and commutative multiplication, we have 


(a+ b)**! = (a + b)(a + b)* 


= (a+ ats (Thee +4 (w+ ‘ Jø: +t] 
1 r k-1 
= k =l eee k —r pr tids & j! | 
a| + (7) b+ + (Pat BP + sf ae Jatt re 
+a + (ea (Tere is Ja + | 
1 NG kel 
= [ates + (Tato +--+ (Perey 4+ ( k e + alt 
1 x k-1 
+ [as + (ew ++ (e es ( 5 Jar + wt] 
1 r k-1 
=at! + e + 1|» + G + (e +e 
k k kept eer k +1 
+ (Gane) 3 Brg + [r+(,* 1) ja + i 
Exercise 5 (which you should do) shows that for r = 0, 1,...,k 
( k 2 (*) = H + i) 
r+i r) \et+ tf 
Apply this fact to each of the coefficients in the last part of the equation above. 
For instance, (‘) +1= (‘) + (5) = 9 nl f and (3) + (5) = 4 > F and 
1 1 0 Lo 2 1 a7 
so on. Then, from the first and last parts of the equation above we have 
(a + byt = okt + i $ "ee + y A yg + 
2. fF 


a al ant (FF Nak + oe 
r+1 k 
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Therefore, the theorem is true when n = k + 1, and, hence, by induction it is true for 
every positive integer n. E 


E Exercises 


A. 1. Let x and y be real numbers. Find the coefficient of x*y' in the expansion of 
(2x — y)’. [Hint: Apply Theorem E.l with a = 2x, b = y*] 


2. If x and y are real numbers, what is the coefficient of x!y* in the expansion of 


(x3 — 3y)! 
B. 3. Let r and n be integers with 0 < r < n. Prove that A = (, g ) 


4. Prove that for any positive integer n, 2” = (6) + @) F G Hp sere E 
[Hint: 2" = (1 + 1)"] 0 N 2 
; k k 
5. Let rand k be integers such that 0 = r = k — 1. Prove that Fh F = 


k+1 i 
. [Hint: Use the fact that 
r+1 


(kK-nk-(r+D= 4-H C+D - +) 
to express each term on the left as a fraction with denominator (k + I)\(k — n)!. Add 
the fractions, simplify the numerator, and compare the result with K < ; ya 
6. Let n be a positive integer. Use mathematical induction to prove this 
statement: For each integer r such that 0 = r = n, (") is an integer [Hint: For 
n = 1itis easy to calculate (3) =1l= G) assume the statement is true for 
n = k and use Exercise 5 to show that the statement is true for n = k + 1.] 


7. Here are the first five rows of Pascal’s triangle: 


Row 0: 1 
Row l: ft it 
Row 2: I 2 a 


Row 3: Ls B f 
Row4: 1 4 64 1 


Note that each entry in a given row (except the l’s on the end) is the sum of the 
two numbers above it in the preceding row. For instance, the first 4 in row 4 is 
the sum of 1 and 3 in row 3; similarly, 6 in row 4 is the sum of the two 3’s in 
row 3. 

(a) Write out the next three rows of Pascal’s triangle. 

(b) Prove that the entries in row n of Pascal’s triangle are precisely the 


coefficients in the expansion of (a + 5)", that is, a al i“ one a 
[Hint: Exercise 5 may be helpful] 4 
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APPENDIX F 


Matrix Algebra 


This appendix may be read at any time after Section 3.1 but is needed only in Chapter 16. 
Throughout this appendix, R is a ring with identity. 

Rings of 2 X 2 matrices with entries in Z, Q, R, and C were introduced in Section 3.1. 
These matrices are special cases of this definition: An 2 X m matrix over R is an array 
of n horizontal rows and m vertical columns 


Mm m m ç > Tis 
m m m °°" Tam 
fy Ty h3 ~? Tom 
m m mw e Fam 


with each r,€ R. For example, 


7 -6 4 10 0 
1 4 0 
1 0 5 =2 1 1 O 10 
a=j3 sa m s| jè 23) e-( 7 5 4) 
0 52 0 -8 
4X SoverZ 3 X 3 over Zs 2 X 4 over Z, 


Matrices are usually denoted by capital letters and their entries by lowercase 
letters with double subscripts indicating the row and column the entry appears in. For 
instance, in the matrix A = (ay) above, the entry in row 4 and column 2 is ay = 5. In 
matrix C, c2 = 0 and cz = 1. Thus, for example, row i of an n X m matrix (ry) is 


fa To fB ra'i 


The n X m zero matrix is then X m matrix with 0, in every entry. The identity matrix J, 
is the » X n matrix with 1, in positions 1-1, 2-2, 3-3, . . . , n-n, and 0, in all other posi- 
tions. For example, over the ring R, 
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Er 10000 
1 0 0 ne A. Sh 01000 
L=|0 1 0 wS T e Fana A eb 
0 0 1 00010 
0 1 

o D 00001 


The identity matrix J, can be succinctly described by Z, = (ôy), where 5yis the Kronecker 
delta symbol, defined by 
by = { lif i=j. 


Orifi +j 
It is sometimes convenient to think of a large matrix as being made up of two 
smaller ones. For example, if A is the 3 X 2 matrix 


4 2 
1 0 
3 3 
over Z, then (A | 4) denotes the 3 X 5 matrix 
L i 0 & @ 
0101 0 
o 0 1 3 6 
2 3 
' A 4 6 2 3 
Simi A : 4 4 
imilarly, ( *) denotes the matrix 1 oF where A ( 4 > 
0 1 


If A = (a) and B = (by) are n X m matrices, then their matrix sum A + B is 
the z X m matrix with ay + by in position ij. In other words, just add the entries in 
corresponding positions, as in this example over 7: 


E 3 deti 2 J = (3 0 J 
o A d i“ wy Wo da af 
If A and B are of different sizes, their sum is not defined. But if A, B, C aren X m 
matrices, then Exercise 3 shows that matrix addition is commutative[A + B = B+ Aland 
associative [A + (B + C) =(A + B) + C]. Then X m zero matrix acts as an identity 
for addition (Exercise 4). 

For reasons that are made clear in a linear algebra course, the product of matrices 
A and B is defined only when the number of columns of A is the same as the number of 
rows of B. The simplest case is the product of a 1 X m matrix A consisting of a single 
bi 
ba |» 


TOW (@; @, 4," * * Am) and an m X 1 matrix B consisting of a single column 


by 


*A matrix with only one row is called a row vector and a matrix with only one column a column 
vector. Single subscripts are adequate to describe the entries of row and column vectors. 
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The product is defined to be the 1 x 1 matrix whose single entry is the element 


ab, + ab, + ab; + agba + +++ + AyD 
For example, over Z 
4 
(*) (2 3 1)|0|=2:4+3:0+1:2=10. 
2 
If A is an n X m matrix and B is an m X k matrix, then the matrix product AB is 


the n X k matrix (¢,), where the entry in position i-/ is the product of the ith row of A 
and the jth column of B: 


M 
Cy = anby + Agby + anby + ayby +t + ambay = Darby 


EXAMPLE 1 


The product of 


d 2 6 3 
2 5 H 
a(i E a and B=]0 1 2 1 

o 6 O 2 


isa 2 X 4 matrix whose entry in position 1-1 is 10 (the product of row 1 of A and 
column 1 of Bas shown in (+) above). In position 2-3 the entry in AB is the product of 
row 2 of A and column 3 of B: 


1-6+5-2+0-0= 16. 


Similar calculations show that 
m- 3 nfo aoe {£ 13 18 a 
I 5 2602 4 7 16 8 


The product BA is not defined because B has four columns, but 4 has only two 
rows. 


If A, B, C are matrices of appropriate sizes so that each of the products 4B and 
BC is defined, then matrix multiplication is associative: A(BC) = (AB)C (Exercise 7). 
Similarly, if E, F, G are matrices such that the products EG and FG are defined, then the 
distributive law holds: (E + F)G = EG + FG (Exercise 5). The identity matrices act as 
identity elements for multiplication in this sense: If A is an n X m matrix, then Å * A = A 
and A : J, = A (Exercise 6). Even when both products 4B and BA are defined, matrix 
multiplication may not be commutative (see Example 6 in Section 3.1). 

Let M,(R) denote the set of all n X n matrices over the ring R. Since all the matri- 
ces in M,(R) have the same number of columns and rows both A + Band AB and BA 
are defined for all 4, Be M,(R). The properties of matrix addition and multiplication 
listed above provide the proof of 
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Theorem F1 


If Ris a ring with identity, then the set M,(R) of all n x n matrices over Ris a 
noncommutative ring with identity /,. 


@ Exercises 


NOTE: Unless stated otherwise, ail matrices are over a ring R with identity. 
A. 1. Assume A and Bare matrices over Z. Find A+ B. 


ea oe, ee 
wah 5 7 My =h 0 4 ') 


3 0 2 te 2 0 
4 1 6 3 0 4 
b) A= B= 
(0) 0 1 oO 0 7 —6 
D Sg 1 6 0 
2. Assume A and B are matrices over Z6. Find AB and BA whenever the 
products are defined. 
2 4 
LO T 
(a) A=]1 5S] B= E 3 2) 
3 0 


©@A=(3 2 1 0) B= 


f4 0 5 3 
wash; 2) 25G a 2) 
0 

1 

0 

1 


= O O = 
= = O © 


B. 3. Let A = (a,), B = (6y), and C = (cy) ben X m matrices. Prove that 
(a) A+ B=B+A t) A+(B+ C)=(A+B)4+C 
4. If A = (ay) is an n X m matrix and Z is the n xX m zero matrix, prove that 
A+Z=A. 
5. (a) Let E and Fbe 1 x m row vectors and G = (g,,) an m X k matrix. Prove 
that (E + F)G = EG + FG. 


(b) Let E = (ey) and F = (fy) ben X m matrices and G = (g,,) an m X k 
matrix. Prove that (E + F)G = EG + FG. 
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6. If Ais an» X m matrix, prove that J,- A = A and A- Ip = A. 
C. 7. Let A = (ay) be an n x m matrix, B = (by) an m x k matrix, and C = (cy) a 
k x p matrix. Prove that A(BC) = (AB)C. [Hint: BC = (dy), where 
k m 


dj = D brch and AB = (ex), where ey = F anbe- The i-j entry of A(BC) is 
=1 


m a m k m kè tal 
Dad; = Sal bec) = > D tibici Show that the ż-j entry of (4B)C is 
t=1 t=] r= t=lr= 


this same double sum.] 
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APPENDIX C 


Polynomials 


In high school there is some ambiguity about the “x” in polynomials. Sometimes x 
stands for a specific number (as in the equation 5x — 6 = 17). Other times x doesn’t 
seem to stand for any number—it’s just a symbol that is algebraically manipulated 
(as in exercises such as (x + 3)(x — 5) = x*— 2x — 15).* Our goal here is to develop a 
rigorous definition of “polynomial” that removes this ambiguity. The prerequisites for 
this discussion are high-school algebra and Chapter 3. 

As a prelude to the formal development, note that the polynomials from high 
school can be described without ever mentioning x. For instance, 5 + 6x — 2x3 is com- 
pletely determined by its coefficients (5, 6, 0,—2).? But 5 + 6x — 233 can also be written 
5 + 6x — 2x3 + 0x4 + 0x5 + 0x*. To allow for such additional “zero terms”, we list the 
coefficients as an infinite sequence (5, 6, 0, —2, 0, 0, 0, 0, . . .) that ends in zeros. 

Adding polynomials in this new notation is pretty much the same as before: Add 
the coefficients of corresponding powers of x, that is, add sequences coordinatewise: 


5+ 6& -2 (5, 6,0, —2,0,0,0,...) 
3 — 2x + 5x7 — 4x3 (3, —2, 5, —4, 0,0, 0, . . .) 
8 + 4x + 5x? — 6x3 (8, 4,5, —6,0,0,0,...). 


Multiplication can also be described in terms of sequences, as we shall see. If you keep 
this model in mind, you will see clearly where the formal definitions and theorems 
come from. 

Except in Theorem 4.1 at the end of this appendix, R is a ring with identity (not 
necessarily commutative). A polynomial with coefficients in the ring R is defined to be 
an infinite sequence 

(ao; Ay, 22, a3, .  .) 
such that each aE R and only finitely many of the a, are nonzero; that is, for some 
index k, a, = Og for all i > k. The elements aE R are called the coefficients of the 
polynomial. 


*Sometimes x is also used as a variable that can take infinitely many values (as in the function 
f(x) = x? — x). This usage is discussed in Section 4.4. 
10 is the coefficient of x’. 
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The polynomials (a, 44, a, . . .) and (bo, bi, b2, . . .) are equal if they are equal as 
sequences, that is, if ay = bg, a, = 5, and in general, a, = b; for every i = 0. Addition of 
polynomials is denoted by @ and defined by the rule 


(a; 2, 22, . . .) ® (bo; b1, ba, . P) = (a + bos a, + by, a + ba, . J -a +5, . P 
You should verify that the sequence on the right is actually a polynomial, that is, that 


after some point all its coordinates are zero (Exercise 2). 
Multiplication of polynomials is denoted © and defined by the rule* 


(ao, G1, a2, - . .) © (bo, brs ba, . . . .) = (Co, Cis C2, - . ), where 
& = aobo 
Cy = ab, + abo 
C2 = agb, + aib, + abo 


Cn = Ady + abai + abaa + abpa +++ + + aibi + abo 


A 
=F ab,- e 
t=0 


To show that the product defined here is actually a polynomial you must verify that 
after some point all the coordinates of (co, ĉ1, . . .) are zero (Exercise 2). 


Theorem 6.1 


Let R be a ring with identity and P the set of polynomials with coefficients in 
R. Then £ is a ring with identity. tf R is commutative, then so is P. 


Proof > Exercise 2 shows that P is closed under addition and multiplication. To 
show that addition in P is commutative, we note that a, + b; = b; + a; 
for all a, b,E R because R is a ring; therefore, in P 


(os &, a...) ® (bo, bi, bn.. ) 
= (dy + bo, a, + by, . . .) = (bo + to, by + a...) 
= (bo, by, ba - ..) D (@, a, Mp.. J 


Associativity of addition and the distributive laws are proved similarly. You 
can readily check that the multiplicative identity in P is the polynomial 
(Ir Op, Or Op, - - .), the zero element is the polynomial (Op, Og, Op, « - -); 
and the solution of the equation (dp, 41, 2, .. .) + X = (Og, Og, Og, . . .) is 
X = (~a ~an, —@..). 

To complete the proof that P is a ring with identity, we must show 
that multiplication is associative. Let A, B, C € P, where 


A = (dp diay...) B= (by bis bas- .) C = (Cos Cis Cz « » »)e 


“To understand the formal definition, do the following multiplication problem and look at the 
coefficients of each power of x in the answer: (a) + a,x + agx")(by + bx + bax’). 
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Then the nth coordinate of (A © B) © Cis 


O Sea 3 [Sab Jes $ Sahe 


Exercise 6 shows that the last sum on the right is the same as 


(+*+) >, Ob yCup 


where the sum is taken over all integers u, v, w such that u + v + w = n 
and u = 0, v = 0, w = 0. On the other hand, the nth coordinate of 
AO(BO Ois 

h a-r 


e) Sabo = En Shana = $ Sabers 


Exercise 6 shows that the last sum on the right is also equal to (++). Since 
the nth coordinates of (A © B) © Cand A © (BO C) are equal for each 
n2=0,(A OB)O C= 40 (BO ©). The proof of the final statement of 
the theorem is left to the reader (Exercise 3). E 


In the old notation, constant polynomials behave like ordinary numbers. In the 
new notation, constant polynomials are of the form (r, 0, 0, 0,... .), and essentially 
the same thing is true: 


Theorem 6.2 


Let P be the ring of polynomials with coefficients in the ring R. Let A* be the 
set of all polynomials in P of the form (r, Og, Op, Og, . . .), with re R. Then R* is 
a subring of P and is isomorphic to R. 


Proof > Consider the function f:R— R* given by 


Se) = (r, Og, Op, Og, « - -)- 

You can readily verify that fis bijective. Furthermore, 

f(r +5) =(r + 5, Op, Op, Op, . - ») 

= (r, Op, Op, Op, - - -) © (S, Or Op, On) = fC) + fO) 

and 

S (rs) = (rs, Or, Op; On, - - .) 

= (r, Op, Op, Or, » - -) O (5, Op, Op, On, - --) = Ar) OS(s). 
Therefore, f is an isomorphism, and, hence, R* is a subring. m 
Now that the basic facts have been established, it’s time to recover the “old” nota- 


tion for polynomials. First, we want polynomials in R* to look more like “constants” 
(elements of R), so 


(a, Op, Op, Op, . . .) will be denoted by the boldface letter a. 


Curricty 2012 Coenen EET Al Bight Gnerend May oct bs copia ext dugir ant. in whale or ia pact. Des to eisctronic dghts, sume third paty codina may be appi fixan the eBook andor oChagtar(s). Edianrial review has 
that noy appi odaat does oct exaieially affect the ovanil laning axperiama Cangege Leeming maarves tha right to recave eiidiional coding? mt asy tine if subanqummt sights cestricicms require it. 


548 Appendix G Polynomials 


Next, reverting to the original source of our sequence notation, 
(Op, lp, Op Op, Or oe ) will be denoted by x. 


There is no ambiguity about what x is here—it is a specific sequence in P; it is not an 
element of R or R*, and it does not “stand for” any element of Ror R*. 
This notation makes things look a bit more familiar. For instance, 


(a, Op, Op, Or, -- .) + (b, Or, Or, . . (Or, Lr, On, OR, - - -) 


becomes a + bx. Similarly, we would expect ex? (the “constant” c times x°) to be the 
sequence (Op, Op, Og, €, Op, Op, . . .) With c in position 3.* But we can’t just assume that 
everything works as it did in the old notation. The required proof is given in the next 
two results. 


Lemma 6.3 


Let P be the ring of polynomials with coefficients in the ring R and x the 
polynomial (Op, 1r, Op, Or, . . .)» Then for each element a = (@, Og, Og, . . .) of 
R* and each integer n = 1: 


(1) x° = (Og, Op, » + +, Op: 1p, Op, » . »), Where 1p is in position n. 
(2) ax” = (Op, Op, « + u Op, 8, Og, .. .), Where a is in position n. 
Proof > The polynomial x can be described like this: 
X = (2, G1, €z «+ .); where e, = Op for all i # 1, and e, = lp. 


Statement (1) will be proved by induction on 2. It is true for n = 1 by 
the definition of x! = x. Suppose that it is true for n = k, that is, suppose 
that 


x* = (d, dis d» . ..), where d, = Op for i # k, and d, = 1p. 
Then 
Atl = ke = (do, di, do, . © J(€0, 15 €z, - - .) = (Fo, Fis Fos «< Ds 
where for each j = 0, 
J 
= Dde- 
i= 
Since e, = Op for i # 1 and d, = Op fori + k, we have 


Tk+1 = doeg+1 to o> + dy yen + dei + dyiyea = de; = Lple = 1p 
0 6 


*Remember that in the polynomial (r, s, z, . . .) the element ris in position 0, s is in position 1, ¢is in 
position 2, etc. 
tSee Appendix C. 
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and, forj#k+ 1, 
= + dei +- + d8 + di1, + de 
ty Selig Son Gh a Gah il, 
0 0 
= d1 == 4_y1p = dya. 


But j — 1 + k since J + k + 1. Therefore, r = dı = Ogforallj#k +1. 
Hence, x**+! = (ro, r1, fa, -..) = (Op, Ors - . +5 Op, 1r, Or, . . .), With 1gin 
position k + 1. So (1) is true for n = k + 1 anid, therefore, true for all n 
by induction. 

A similar inductive argument proves (2); see Exercise 7. Ml 


Theorem 6.4 


Let P be the ring of polynomials with coefficients in the ring R. Then P 
contains an isomorphic copy R* of R and an element x such that 
(1) ax = xa for every a e R*. 
(2) Every element of P can be written in the form ag + ax + ax? + 
-+ an’. 
(3) Ifa + ax +++ + + ax = bo + bix +e + bax” with n = m, then 
= b, for į =s n and b, = 0g for/> n; in particular, 
(4) ao + ax + aax? + +++ + anx” = Og if and only if a, = Og for every i = 0. 


Proof» Let x be as in Lemma G.3. The proof of (1) is left to the reader (Exercise 5). 


(2) If (ao, a, 22, . . .) E P, then there is an index n such that a, = Og for 
all i> n. By Lemma G.3 


(Gq, h, Ay, «+ 5 By Op, Op, . - -) 
= (A, Op, Op, .- .) + (Or, a, Or, -- -) + (Op, Op, %; Op,» « -) 
+: + (Op... 5 OR, ans Op, ---) 
= dy + ax + aye +++ ++ ap. 
(3) Reversing the argument in (2) shows that ay + ajx +: +++ a,x" 
is the sequence (do, @), 22, . » - Ans Op, Op, . - .) and that by + bx + +++ + 


b,,X™ = (bo, bi, b2, . « . » bms Op, Or, - - -). If these two sequences are equal, 
then we must have a, = b, for isn and Og = b; forn<ism. 


(4) is a special case of (3): Just let b; = Og. E 


When polynomials are written in the form ay + a,x + + ++ + a,x", addition and 
multiplication look as they did in high school, except for the use of boldface print in 


certain symbols. 
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EXAMPLE 1 


In the ring of polynomials with real-number coefficients, the distributive laws 
and Theorems G.2 and G.4 show that 


(3x + 1)(2x + 5) = Gx + 1)2x + Gx + 135 
= 3x2x + 1:2x+3x5+1°5 
=3+2xx+1-2x+3°5x4+1°5 
= 62+ 17x +5. 


In terms of elements, the distinction between boldface and regular print is 
important because a is a sequence, while a is an element of R. But in terms of algebraic 
structure, thereis no need for distinction because R* (consisting of all the boldface a’s) 
is isomorphic to R (consisting of all the a’s). Consequently, there is no harm in identi- 
fying R with its isomorphic copy R* and writing the elements of R = R* in ordinary 
print.* Then polynomials look and behave as they did before. For this reason, the 
standard notation for the polynomial ring is R[x], which we shall use hereafter instead 
of P. 

We have now come full circle in terms of notation, with the added benefits of 
a rigorous justification of our past work with polynomials, a generalization of these 
concepts to rings, and a new viewpoint on polynomials. Beginning with a ring R with 
identity we have constructed an extension ring R[x] of R (that is, a ring in which R is a 
subring). This extension ring contains an element x that commutes with every element 
of R. The element x is not in R and does not stand for an element of R. Every element 
of the extension ring can be written in an essentially unique way in terms of elements of 
R and powers of x. Because x has the property that aj + ax +--+ + ap = Opif and 
only if every a, = Og, x is said to be transcendental over R or an indeterminate over R.' 

We are now in position to prove Theorem 4.1, in which the ring R need not have 
an identity. 


Theorem 4,1 


If R is a ring, then there exists a ring 7 containing an element x that is not in 
R and has these properties: 


(i) R is a subring of T. 
(ii) xa = ax for every a ER. 


*You've been making this identification tor years when, for example, you treat the constant 
polynomial 4 as if it were the real number 4. The identification question can be avoided by 
rewriting the definition of polynomial to say that a polynomial is either an element of R or a sequence 
(a, &, ...) with at least one a; + Og for i = 1 and all a; eventually zero. Then the polynomials actually 
contain R as a subset. The definitions of addition and multiplication, as well as the proofs of the 
theorems, then have to deal with several cases. Proceed in the obvious (but tiring) way until you 
have provedTheorem G.4 again. 

tThe latter terminology is a bit misleading since x is a well-detined element of R[x]. 
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(iii) The set Rix] of ali elements of 7 of the form 
ay tax tax? +++: +a x"  (wheren =0andaeR) 
is a subring of 7 that contains A. 
(iv) The representation of elements of R[x] is unique: If n = m and 
A + A,X + ayn? + 6+ + + ap = by + bix + Dox? + + >= + by”, 


then a, = b, fori = 1, 2,...,n and b, = Op for each i> n. 
(V) a + ax + ax? ++ ++ + ax" = Op if and only if a, = Op for every i. 


Proof > There are two cases: (1) R has an identity; and (2) R does not have an 
identity. 
Case 1: Use Theorems G.| and G.4, with T = P = R[x] and R* identified 
with R. 


Case 2: Let S be a ring with identity that contains R as a subring. With 
many familiar rings, an S is easy to find. For example, ring of even inte- 
gers has no identity, but is a subring of Z, which does have an identity. 
For the general case, use Exercise 39 of Section 3.3. 

Apply Case 1 with S in place of R, to construct S[x] = T. The poly- 
nomials in S[x] whose coefficients are actually in R form a subring of 
S[x] = T that contains R, as you can readily verify (Exercise 10); this 
subring is R[x]. Hence, property (i) of the theorem is satisfied. Since 
properties (iiv) hold for all elements of S[x], they necessarily hold for 
all elements of R[x]. m 


Finally, note that 
When R does not have an identity, the polynomial x is #o7 itself in R[x]. 


For instance, the ring of polynomials over the ring R of even integers consists of all 
polynomials with even coefficients. So it does not contain x = lx or any polynomial 
kx with k odd. 


E Exercises 


A. 1. Express each polynomial as a sequence and express each sequence as a 
polynomial. 


(a) (0, 1, 0,1, 0, 1, 0,0, 0,...) 

(b) (0, 1, 2, 3, 4, 5, 6, 6, 8, 9, 0, 0,0,...) 
(© 3x6 — 5x + 12x3 — 3x? + 7.5x — 11 
(d) (« — DN- a+ 


2. (a) If (a), az, . . .) and (5), 5, . . .) are polynomials, show that their sum is a 
polynomial (that is, after some point all coordinates of the sum are zero). 
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(b) Show that (a,, a), . . .) © (b), ba, . . .) is a polynomial. [Hint: If a, = Og for 
i> k and b, = 0g for i > t, examine the ith coordinate of the product for 
i>k+t] 


. Prove these parts of Theorem G.1: 


(a) addition in P is associative; 
(b) both distributive laws hold in P; 


(c) P is commutative if R is. 


. Complete the proof of Theorem G.2 by proving that 


(a) fis injective; (b) fis surjective 


5. Prove (1) in Theorem G4. 


6. (a) In the proof of Theorem G.1 (associative multiplication in P) show that 
no? 


D 5 abijin-i = > a,b,¢,, where the last sum is taken over all 

=0/=0 
nonnegative integers u, v, w such that u + v + w = n. [Hint: Compare the two 
sums term by term; the sum of the subscripts of a,_;¢,_; 18 7; to show that 
@,,5,¢y is in the other sum, let j = u and i = u + v and venfy that n — i = w.] 


LAE, ld 


(b) Show that D Diadben- y= Dabu [last sum as in part (a)]. 


r=07=0 


. Prove (2) in Lemma G.3. [Hint: a = (dg, 41, a2, .. .), where a, = Opg for i > 1, and 


by (1), x" = (d, di, dz, .. .), where dp, = 1p and d; = Og for i # n; use induction 
on R.] 


. Let R be an integral domain. Using sequence notation, prove that the 


polynomial ring R[x] is also an integral domain. 


. Let R be a field. Using sequence notation, prove that the polynomial ring R[x] 


is not a field. [Hint: Is (Og, 1p, Op, Og, . . .) a unit? 


. In the proof of Case (2) of Theorem 4.1, show that R[x] is a subring of S[x] 


that contains R. 


. (a) Let Q[r] be the set of all real numbers of the form ro + rim + rom + 


+++ + r,t", where n = 0 and each r E Q. Show that Q[7] is a subring 
of R. 


(b) Assume that ro + rim + + ++ + rym" = Oif and only if each r, = 0. (This 
fact was first proved in 1882; the proof is beyond the scope of this book.) 
Prove that Q[7] is isomorphic to the polynomial ring Q[x]. 
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ANSWERS AND SUGGESTIONS 
FOR SELECTED 0DD- 
NUMBERED EXERCISES 


For exercises that ask for proofs, there may be a sketch of the full proof (you fill in minor 
details), a key part of the proof (you fill in the rest), or a comment that should enable you to 
find a proof. 


Chapter | 


Section 1.1 (page a) 


1. (a) q=4%r=1 (b) g=0;r=0 (ce) q= -S5;r=3 

3. (a) g=6;r=19 (b) q = —9; r = 54 {c) q = 62,720; r = 92 

5. Multiply the equation and the inequality by c. Apply the Division Algorithm 
appropriately. 


7. If a= 3q + 1, then a = (3g + I)? = 9’ + 6g + 1 = 3(3q? + 2q) + 1, which is 
of the form 3k + 1 with k = 3g? + 2g. Use similar arguments when a = 3q or 
a=3q+2. 


9. By the Division Algorithm, every integer a is of the form 3q or 3g + 1 or 3q + 2. 
Compute a’ in each case and proceed as in Exercise 7. 


Section 1.2 (page 14) 

L (a) 8 ()1 (9 (g) 592. 

3. a|b means b = au for some integer u. Similarly, b |c means c = bv for some 
integer v. Combine these two equations to show that c = a - (something), which 
proves that a|c. 

5. a|b means b = au for some integer u, and b |a means a = bv for some integer v. 
Combine the equations to show that a = euv, which implies that 1 = uv. Since u 
and v are integers, what are the only possibilities? 


7. |a|—Why? 


9. Advice: Before trying to prove a simple statement, check to see if there are any 
obvious counterexamples. 


11. (a) lor2 


13. (c) By parts (a) and (b), the set of commondivisors of a and b is identical to the 
set of common divisors of b and r. What is the largest integer in this set? 


656 
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19. Supposed |a and d|b, so that a = du and b = dv. Since a|(6 + c), b + c = aw. 
Hence, ¢ = aw — b = duw — dv = duw — v), so that d| c. Since (b, e) = 1, what 
can you conclude about d and (a, b)? 


21. Every common divisor of a and (b, c) is also a common divisor of (a, 6) and c. 
[Proof: If d|(6, c), then d|b and d|c by the definition of (b, c). If d|a also, then d 
is a common divisor of a and b, and, hence, d] (a, b) by Corollary 1.3.] A similar 
argument shows that the common divisors of (a, b) and ¢ are also common 
divisors of a and (b, c). 


25. (a) (a, b) divides both a and b by definition. What does this say about (a, b) and 1? 


27. d= cu + av for some u, v (Why?). Hence, db = cbu + abv. Use the fact that 
ab = cw for some w (Why?) to show that c | db. 


29. First show that every integer # is the sum of a multiple of 9 and the sum of its digits. 
[Example: 7842 = 7 - 1000 + 8 - 100 + 4 - 10+ 2 = 7(999 + 1) + 8999 + 1) + 
49 +1)4+2=(7°999 + 8-994 4-9)4+(74+84+442)=M47°1114+8:11 +4 
+(7+8 + 4+ 2).] Thus, every n is of the form 9k + r, where r is the sum of the 
digits of n. Hence, nis divisible by 9 if and only if 9 divides r. 


31, (a) 30; 60; 420; 72 


33. Let d = (a, b). Then a = du and b = dv for some integers u and v. Let m = ab/d. 
Show that m is a common multiple of a and b. If c is any other common multiple 
of a and b, use Exercise 26 to show that m = c. What does this tell you? 


Section 1.3 (page 22) 
1. (a) 5040 = 2% -3-5.7 (c) 45,670 = 2+5- 4567 
3. All of them. 


5. (a) 3, 37,3%,..., 353+ 5, 32+ 5, 39+5,..., 3+ 553+ 5% 32+ 5%, 39-54, 
BF 523-532 35 St 326 5 3-5 3 585, 5% SE 

7. Because p divides a, there is an integer k such that a = pk. Similarly, a + bc = pd 
for some integer d. Hence bc = pd — a = pd — pk = p(d — k). Apply Theorem 1.5. 


9. (+=) Suppose p has the given property and let d be a divisor of p, say p = dt. By 
the property, d = +1 (in which case t = +p) or t = +] (in which case d = +p). 
Thus the only divisors of p are +1 and +p, and pis prime. 


11. a — b = pv and c — d = pw for some v, w (Why?). Add the two equations and 
rewrite each side of the sum equation to obtain the fact that p divides 
(a+ c) — (b +d). 

17. Every prime divisor of a’ is also a divisor of a by Theorem 1.5, and similarly for 5”. 


b = PE aoe PE E 
a pip 
the p; are distinct primes, each of the exponents on the right side of the preceding 
equation must be nonnegative (Why?)—that is, sı — rı = 0, s2 — r2 = 0, ..., 

Sk — r2 0. 


19. py +: pÈ ™. Since a|b, we know that 2 is an integer. Since 


21. If c has prime decomposition pp, * * * Py then ab = £ = pypypp, * * * PyPy- Now Py 
must divide a or b by Theorem 1.5, say a. Since (a, b) = 1, p, cannot divide b. Hence, 
(p;)"|a. By relabeling and reindexing if necessary, show that a = pypyprp2 ‘+ * Pypy = 
(PaP2* ** Py? and b = Py Pps ** *PaPe = Op Pen" ** Pa 
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23. Suppose a and b are positive and a”|6*. Suppose that a = pitp}- - - pz and 
b = pip? «+ pk, where p;, Pa, - - - , pare distinct positive primes with each 
Ta Sı = 0 (see Exercise 13). Then & = p?^p?* -- - p?* and b? = p}tp? -- «p?* and 
because q’ |b? we have 2r, = 2s, and hence r, = s, for each i= 1, 2,..., k by 
Exercise 19. Thus, there are nonnegative integers u),..., u, such that s = 7; + u 
for each i. Use this fact and the prime decompositions of a and b to show that 
a|b. The converse is easy. 


25. Exercise 6 in Appendix E shows that V4 is an integer. (*) = p,and for k > 1, the 


denominator of (7) is the product of integers that are each strictly less than p. 


27. If p > 3 is prime, then p = 6k + | or 6k + 5 (Why can the other cases be 
eliminated’). If p = 6k + 1, then p? + 2 = (6k + 1} + 2 = 36k + 12k + 3 = 
3(12K + 4k + 1). The other case is handled similarly. 

29. Let k be the highest power of 2 that divides n. Then n = 2*m for some integer m, 
which must be odd because otherwise 2**! would divide n, contradicting the 


fact that k is the highest power of 2 that divides n. Uniqueness follows from the 
Fundamental Theorem of Arithmetic. 


33. Verify that 7 — 1 = (x — IQ + x2 +--+ +2 +x + 1). Conclude that 
y™ — 1 = (f — | has y” — las a factor. Apply this fact with y = 2 and p = mn 
to show that 2” — 1 is composite whenever p is. 


Chapter 2 


Section 2.1 (page 30) 

1. (a) 2*= 16 = 1 (mod 5) 

3. {a} and (c) 

5. (a) 5= 1 (mod 4), so 5%” = pom = | (mod 4) by Theorem 2.2. Apply Theorem 2.3. 
(b) First, find a negative number that’s congruent to 4 (mod 5). 


7. By Corollary 2.5, a = 0 ora = l or a = 2 or a = 3 (mod 4). Hence, a’ is 
congruent to 0? or |? or 2? or 3? (mod 4) by Theorem 2.2. 

9. (a) (n — af = — 2na + @. Hence, (n — a’ — a’ is divisible by n. 

13. (=) By the Division Algorithm, a = gn + r and b = pn + s with the remaindersr 
and s satisfyingO Sr < nand 0 = s < n. If a = b (mod n), then a — b = kn (Why?), 
and, hence, kn = (qn + r) — (pn + s), which implies that r — s = (k — q + p)n, that 
is, n|(r — s). Since r and s are strictly less than x, this is impossible unless r — s = 0. 
To prove the converse, assumer = s and show that n| (a — b). 

15. Use Theorem 1.2 and the definition of congruence. 

17. Note that 10 = —1 (mod 11) and use Theorem 2.2. 


19. a — b = nk for some k (Why?). Show that any common divisor of a and n also 
divides b, and that any common divisor of b and » also divides a. What does this 
say about (a, n) and (b, n)? 


21. 10 = | (mod 9); hence 10' = 1" = 1 (mod 9) by Theorem 2.2. 
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Section 2.2 (page 36) 

a + | g - |o om 
w fo m oO loa A 
u 1m 0 m lo 1 


©+ | 0O 1 2 B A 


HW) a BD A S A A 
2/2) 8 A S & A 
B| B 4 5 & A N B 
4 A S & © G a B 
[5] | 5] © © G a B A 
[6 | 4 © 1 A B A eS 


5} 5 8 © A a 
| O 6 6) 4 8 a 


3. x = [1], (3) [5} or [7] 

§. x = [I], [2], [4], or [5] 

7. x = [3] or [7] 

9. {a} [a] = [3] or [5] (c} No 

11. (a) x= [0], [1} or [2] (©) x = [0} (1, (21, [3} or [4] 
13. Look in Z4 or Ze- 

15. (a) [aP + [bP (c) [aF + bF 


Section 2.3 (page41) 
1. (a) a= 1, 2,3, 4, 5, and 6 {c) a = 1, 2, 4, 5, 7, and 8. 
3. Several possibilities, including Exercise 10. 


§. Since b is a zero divisor, be = 0 with b+ 0 and ¢ #0. Hence, (ab)c = 0. Use the 
fact that a is a unit to show that ab + 0. What do you conclude? 


7. ab =OinZ, means p|ab in Z. Apply Theorem 1.5 and translate the result into Z,. 


9. {a} Sinceaisa unit, ab = 1 for some b. If a were also a zero divisor, then we would 
have ac = 0 for some c # 0. Consider the product abc and reach a contradiction. 
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11. Existence of a solution: au = | for some u (Why?). Multiply both sides of 
ax = b by u. Uniqueness: Assume that r and s are solutions of ax = b and use 
the fact that a is a unit to show that r = s. 


15. (a) 3, 9, 15. 


17. If a and care units, then ab = 1 and cd = 1 for some b, d. Use this to show that 
ac is a unit. 


Chapter 3 


Section 3.1 (page 53) 
1. (a) Closure for addition. 
5. (a) Subring without identity (every product is the zero matrix) (c) Not a subring 
{e} Commutative subring with identity. 


7. Axioms 1—S5 are easy to verify. Is K closed under multiplication? 


11. (a) Partial proof: Closure under addition holds since + na + i F = 


( Ta Wer ‘) ES. The zero matrix is in S. Use Theorem 3.2. 
btd b+d 


{c} J fails to be a left identity for any B € S whose bottom row is nonzero— 
check it out. 
13. Use Theorem 3.2. Closure under addition: (a + 6V2) + (c + dv2) =. 


(ate) + (6+ 4)V2e2 (V2) sincea + cEZ and b + deZ. Closure under 
multiplication: See Example 20. Also, 0 = 0 + 0V2EZ (v2). You do the rest. 


15. (a) + | (0,0) (1,1) (0,2) (1,0) (0,1) (1,2) 


(0,0) | (0,0) (1,1) (0,2) (1,0) (0,1) (1,2) 
(1,1) | (1,1) (0,2) (1,0) (0,1) (1,2) (0,0) 
(0,2) | (0,2) (1,0) (0,1) (1,2) (0,0) (1,1) 
(1,0) | (1,0) (0,1) (1,2) (0,0) (1,1) (0,2) 
(0,1) | (0,1) (1,2) (0,0) (1,1) (0,2) (1,0) 
(1,2) | (1,2) (0,0) (1,1) (0,2) (1,0) (0,1) 


* |09 dj) @2) (1,0 0,1) (1,3 


(0,0) | (0,0) (0,0) (0,0) (0,0) (0,0) (0,0) 
(1,1) | 0,0) (1,1) (0,2) (1,0) (0,1) (1,3) 
(0,2) | (0,0) (0,2) (0,1) (0,0) (0,2) (0,1) 
(1,0) | (0,0) (1,0) (0,0) (1,0) (0,0) (1,0) 
(0,1) | 0,0) 0,1) (0,2) @,0) (0,1) (0,2) 
(1,2) | (0,0) (1,2) (0,1) (1,0) (0,2) (1,1) 
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4 [10 eS A BB C D E EB 
0;0 S A B CC D E F 
S/S 0 F & D © B A 
Al:A R © 1D #& BEG SS 
BIB E Dh FAS EG 
EIE D E F O S$ A B 
DAD C B A S ~ P E 
EVE B €C S A F OO D 
EJF A S € BREEDD 

O S A B CDE F 
0/0 0 0 0O 0 @ 0 0 
S~ & A 2 G6 B E F 
AJO A A 0 0O A A 0 
BO sB 0 B O B O B 
CIO € O © C 0U € 6 
DIO D A RBR ODAR 
E |0 E A O C A E G 
ih sO F O B te B C E 


21. The multiplicative identity is 6. 


23. To prove that E is closed under +, you must verify that when a and b are even 
integers, sois a * b = ab/2. To prove that * is associative, verify that(a +b) *c= 
a»(b*c)as follows By definition, (a * b) * c = (ab/2) *c = = Express 
a + (b * c) in terms of multiplication in Z and verify that the two expressions are 
equal. Commutativity of + is proved similarly. To prove the distributive law, you 
must verify that a * (b + c) = a * b + a * c, that is, that a(b + c)/2 = ab/2 + 
ac/2. If there is a multiplicative identity e, then it must satisfy e + a = a for every 
a E E, which is equivalent to aa/2 = a in Z. But ea/2 = a implies that e = 2. 


25. Partial proof: Axiom 4: The zero element is —1 because r ® (—1) = r + (—1) + 
1 =r. Axiom 5: Since —] is the zero element, we must show that the equation 
a ® x = —1 has a solution. The solution is x = —2 — a because a ® (—2 — a) = 
a + (—2 — a) + 1 = —1. To prove that this ring is an integral domain, you must 
assume that a © b = —1 and show that a = —1 orb = —1. Nowa Ob = —1 
means that ab + a+ b = —1 in Q, that is, that ab + a + b + 1 = 0. Factor the 
left side and use the fact that Q is an integral domain. 

a 


27. Partial proof: If c and d are odd, then so is cd. Hence, 5 += — be 


S is closed under addition. 0 € S since, for example, 0 = 0/5. Use Theorem 3.2. 
As to S being a field, what is the solution of (2/7)x =1? 


E S, and 
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31. 


35. 
37. 
39. 


colt eg)" 


_{k 0 _fa b 
(b) If K = ie °) and A = ( P), then 


ro E E ja 
ke kd} \ck dk) \e dJ\O k) ~” 
Consider R = £,, S = #, and examine the table in the answer to Exercise 15(a). 


(a) Copy the proof used for M(R) in Example 6. 


The proof that Q( V2) is a ring is essentially the same as in Exercise 13. The hint 
shows how to verify that the solution of (r + sV/2)}x = | is actually in Q( V2). 


41. (b) Partial proof: uf i ) is a right identity, then 
a a\(x x\_ (a a 
b b y) \b b 

ax + ay as _ fa 5 
bx+by bx+by) \b b 

fen +y) a(x+ A z 4 

b(xt+y) blx+y)) \b b) 

This last equation holds only when x + y = 1. 
43. (b) Since H is contained in the ring M(C), its addition is commutative and 


associative, its multiplication is associative, and the distributive law holds. So you 
need to verify only that H is closed under addition and multiplication, that the 
zero and identity matrices are in H, and that the negative of every matrix in H is 
also in H. 


Section 3.2 (page 66) 
1. (a) a — ab + ba — b. 
3. (b) 0,1,4,9 


5. (c} No. Suppose u is a unit in R with inverse «~! and v is another inverse of u. 
Then uv = lp, so that u uy = 0 p Which implies that v = u!. Hence, there is 
(@ +4bd 4ad+ io 3 


only one inverse. 
jar S\bc+tad 4bd+ ae 


) e S. Verify that S is closed under subtraction and 


on (a 4b\fe 
9. Closure under multiplication: ( E wa \( d 


ac + 4bd 4(ad + be) 
yh +be ac+ 46d 
apply Theorem 3.6. 
Sis nonempty since 0g E S (Why?). If r, sES, then by definition mr = 0p and 
ms = Op. Hence, m(r — s) = mr — ms = 0p — 0g = 0g. Sor — s E S. Similarly, 
by Exercise 23, m(rs) = (mr)s = Ops = 0g. So rs € S. Therefore, S is a subring by 
Theorem 3.6. 


(b) Many possible examples. Almost any pair of invertible matrices in M(R) will 
provide an example. 


1}. 


15. 
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17. If ub = Og and u is a unit with inverse v, left multiply both sides of ub = Og by v 
to conclude that b = Op. If cu = Og, a similar argument (with right multiplication 
by v) shows that c = Og. Thus, there is no nonzero element whose product with u 
is Op and, hence, u is not a zero divisor. 

19. If (a, 5)(c, d) = (lpr, 1s), what can be said about ac and bd? 

21. ab = ac is equivalent to a(b — c) = Op. 

25. {a) See Exercise 21 of Section 3.1 (to which the answer is “yes”). 
(b) Consider 1g lg and |slcand use Exercise 21. 

27. No. For a counterexample, let b be almost any matrix in M(R). 


31. {a) (a + af = a + a because x7 = x for every x. But (a + a} = (a + a)(a + a) = 
+++ =at+tata+a. 


39. (b) No. You should be able to find a counterexample, 
41. (b) 12 


Section 3.3 (page 80) 

1. The tables for Z, X Z, are in the answer to Exercise 15 (a) of Section 3.1. 

3. If f(a) = f(b} then (a, a) = (b, b), and, hence, a = b by the equality rules for ordered 
pairs Therefore, f is injective. f(a + b) = (a + b, a + b) = (a, a) + (b, b) = f(a) + fb). 
Complete the proof by showing that f(b) = f(a) f(b) and that fis surjective. 

11. Many correct answers, including the following. 

(a) f does not preserve addition; for example f (4 + 9) = V4+9 = V13 = 3.6, 
but f(4) + f(9) = V4 + V9 =2 + 3 = 5. So f(4 + 9) #f(4) + fO). 

{b) {does not preserve multiplication; for example f(2 - 5) = f (10) = 30, but 
SB: SO = 65) = 90. Sof - 5) + JQ + f(5). 

13. Partial proofs: (a) To prove f is surjective, let re R. Then (r, Os) R X Sand 
SG, 0,)) =r. Hence, fis surjective. 

(c) If ais a nonzero element of S, then f((Og, a)) = Og = f (Opr, 05)), but 
(Og, a) # (Opr, Os). Hence, f is not injective. 

17. Surjective: If a + bi is acomplex number, then f(a — bi) = a — (—bi) =a + bi. 
Injective: If f(a + bi) = f(c + di), use the definition of f and the definition of equality 
for complex numbers (Example 11 of Section 3.1) to show that a + bi = c + di. 

21. The multiplicative identity in Z* is 0. If there is an isomorphism f:Z — Z*, 
Theorem 3.10 shows that f must satisfy f(1) = 0. Hence, f(2) = fA + 1) = 
SYA) =0@0=0+0-—1= —-1. Similarly, (3) = f+ 2) = 
SODS) =0 © (-1) = 0 + (1) —1 = —2. What is f(4)? £(5)? f(D? Find a 
formula for f. Then use this formula to show that f is injective, surjective, and a 
homomorphism. 


25. f is not an isomorphism because it is not injective. For instance, 


Aa s)=1=0 ohms s)*(0 o) 


27. (a) Because f and g are homomorphisms, (f° g)(a + b) = f(g(a + b) = 
Sela) + g)) = fE) + S(g6)) = (f° 8)@ + (f° 8)(6). A similar argument 
shows that (f° g)(ab) = (f° g)@): (f° g&)(b). (continues on next page) 
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(b) You must show two things: (1) If f and g are injective, so is f ° g; and (2) if f 
and g are surjective, so is f ¢ g. To prove (1), assume ( f” g)(a) = (f ° g)(b), that is, 
S(g(a)) = f(g(6)). Then use the injectivity of fand g to show a = b. 

3h. Since f(0g) = Os€ T, we see that Og € P; so P is nonempty. Let a, b € P; then 
f(a)€T and f(b) € T. Hence, f(a — b) = f(a) — f(b) € T. Thus, a — b E€ P. A 
similar argument shows that ab € P. Therefore, P is a subring by Theorem 3.6. 

35. (a) Z has an identity and E doesn’t. (c) The rings have different numbers of 
elements, and so no injective function is possible from Z, X Zy4 to Zyg. (e) The 
equation x + x = Og has a nonzero solution in Z X Z, (What is it?) but not in Z. 


37. (b) Since fis nonzero, there exists a € S such that f(a) # 07. Hence, (1s) f(a) = 
S(1s a) = f(a) + Or, which implies that f(1s) # 07 Show that f(ls) is an 
idempotent and apply part (a). 


Chapter 4 


Section 4.1 (page 93) 
L (a) 3x4 +£ 42x74+2 (Qal. 
3. (a) Bete Seta etext et letersi tathet+etxt 1. 
5. (a) ox) = 3x7 — 5x + 8: r(x) = —4x — 6. 
(©) q(x) = 3? + 3x7 + z + 3; (x) =4. 
9. Yes (read the definition of zero divisor and remember that R is a subset of R[x]). 
Lt. The fact that (r + s)(r — s) =? — $ may be helpful. 


13. There exists g(x) € R[x] such that f(x)g(x) = Op (Why?). Suppose g(x) = by + bx + 
. -- + byat (with b, # Og). Multiply out f(x)g(x) and look at the coefficient of 
x**, What must this coefficient be? And what does that say about a. 


15. (b) Add one term to the polynomial in the hint for part (a). 


17. If 0 # b ER, then bE Rix] and | p = bq(x) + r(x). Use the fact that deg b = 0 
to show that r(x) = 0 and q(x) € R. Hence, every nonzero element of R has an 
inverse. 


Section 4.2 (page 99) 
1. If 0p # c EF, then c has an inverse; hence, f(x) = oe” 'f(2)). 
5. (@)x-1 ©2x-1 ©x-i. 


7. Since f(x)| (x + 1) and f(x)| x, f(x) must divide (x + 1) — x = 1. Hence, 
deg f(x) = 0; so f(x) is a constant. 


9. 1,is a linear combination of f(x) and 0; (Why?). What does this imply? 
15. Every divisor of A(x) is also a divisor of f(x). 


Section 4.3 (page 103) 
2 1 5 
oni? nie = rent 
LO #+ +t t3 (c) -ixti 
3. (a) x7 + x + 1; 2x7 + 2x + 2; 3a? + 3x +3; 4x? + 4x +A. 
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7. (=) Suppose f(x) is irreducible and g(x) = ¢f(x), with Op # c E F. If g(x) = r(x)s(), 
then f(x) = (c“'r(x))s(x), and, hence, either c~'r(x) or s(x) is a nonzero constant by 
Theorem 4.12. If c~'r(x) is a constant, show that r(x) is also a constant. Hence, g(x) 
is irreducible by Theorem 4.12. 


9 (a) artxtl (cf etle 2042,274 2274441; 
207 + 2x + 1. 
11. If it were reducible, it would have a monic factor of degree 1 (Why?), that is, a 


factor of the form x + a with a E Z;. Verify that none of the seven possibilities is 
a factor. 


13. (x — 3)(x — 43. 

15. (a) If f(x) €2,[>] is a monic reducible quadratic, thenit nust factor as f(x) = 
(cx + d)(c"'x + e) for some c, d, ee Z, (Why). Hence, f(x) = ex + dee '(x + ec) = 
(x + ax + b) with a = de“ and b = ec. When counting the possible pairs of factors, 
remember that, for example, (x + 2x + 3) is the same factorization as (x + 3Xx + 2). 
Also consider factorizations such as (x + 2)(x + 2). 


23. (a) Proceed as in the answer to Exercise 11, with Z, in place of 7. 


Section 4.4 (page 109) 

1. (a) Many correct answers, including f(x) = x? + x. 
3. (a) No; f(—2) # 0. (c) Yes. 

5. The Factor Theorem may be helpful. 

7. Show that every element of Z; is a root of x’ — x. 
9. In Za [x]: a7 + 1; 27 +4 42;x7 4+ 2x4 2. 


13. (a) If f(x) = cg(x) with c + Og, then g(x) = c!¥(x). Hence, g(u) = Op implies 
f(s) = Og and vice versa. 


15. If x? + 1 is reducible, then x7 + 1 = (x + a)(x + b) for some a, b E Z, (see the 
answer to Exercise 21(a) of Section 4.3). Expand the right side. 


19. (a) If f(x) = (x — a)‘g(x) with g(a) + 0, then f'(x) = k(x — a}~!g(x) + 
(x — a)¥g'(x). If ais a multiple root of f(x), then k = 2 and k — 1 = 1. If aisa 
root of both f(x) and f(x), show that k = 2. 


23. (a) Letn be the maximum of the degrees of f(x), g(x), and A(x). Using zero 
coefficients as necessary, we have f(x) = dg + ayx + * ++ + aX”, g(x) = by + bix + 
200 + By, and A(x) = co + cix + +++ + cpa”. Then in F [x], g(x) + A(x) = (bo + 
bix +e + bra”) + (co + ext eee + gh") = (bo + co) + (bi + x tee + 
(8, + ¢,)x*. Since f(x) = g(x) + A(x) in F[x], we must have a) = bo + cp, a, = bi + cy, 
a, = b, + cn. Therefore, in F, g(r) + A(r) = (& + cp) + (bi + a)r +--+ 
(6, + ¢,)r" = ao + art +++ +a," = f(r). 


29. The proof is by induction on the degree x of f(x). If n = 0, then f(x) is a nonzero 
constant polynomial and therefore has no roots. So the corollary is true for 
n = 0. Now assume that the corollary is true for all polynomials of degree k — 1 and 
suppose that deg f(x) = k. Prove that the corollary is true for f(x) (that is, when 
n = k). [You supply the work here.] Conclude that the corollary is true for every 
degree n. 
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Section 4.5 (page 119) 
1. (a) (Dax + Dæ 2+1) © xxx + JE- NGBx- 1) 
(e) (x + 32x + IDG? + 1). 
3. Use the Rational Root Test. 
5. (a} Letp =2. (c) Let p =.20rp =:3. 
7. (a) Let p = 5 and use Corollary 4.19. 
11. Apply Eisenstein’s Criterion and Corollary 4.18. 


17. A polynomial of degree k has k + 1 coefficients. There are n choices for each 
coefficient except the coefficient a, of x*. How many choices are there for a,? 


19. (a) (x + 2)(x -HŽ + 24? + 4x + 2) 


Section 4.6 (page 123) 
1. (a) 1-2514+253;-2 (e) 3428 3-25 -14+%-1-i. 
3. (a) x4 — 2 in Qla]; (x7 + V2)(x + V2)(x — 47) in REA: 
(x — V(x + Vix + (x — V2) in Ch]. © x- DE? -— 5) in Op} 
(x — D(x + V5)(x — V5) in R[x] and Cfa]. 


5. Nonreal roots of f(x) occur in pairs by Lemma 4.29. 


Chapter 4 


Section 5.1 (page 129) 


1. (a) f(x) = gŒ) (mod pa)) (b) fx) = g(x) (mod p(x) 
(c) f(x) * g(x) (mod p(*)) 


3. There are eight congruence classes. 

5. Use Corollary 5.5. 

7. Each congruence class can be written in the form [a], with ae F. 
9. 


. See the answer to Exercise 13 of Section 2.1 with f(x) and g(x) in place of a and b. 


Section 5.2 (page 134) 


1. 

+ 0 0] [x] [Ix+ 1] [x] bê +1] [2+ x] +x +1] 
(0) (0) 0] b] +1] på +1] etx) Aata 
ay) [1] (U) k+ [x] p+ 1] p3 be+x+1]) [+x] 
e] b] +] 9] 0] B +x) p +x+1] Bb?) [?+1] 
[x+1] [ix+1] e] 0] O] +x +1] b? +2] bê +1] [x 
eal ix) [x7 + 1) be? + x] e+xt+)) 0 1 [x] [x+1] 
[b+] [P+] [Eg] bW +txti ta [i] IQ [x+]] [x] 
+a [xP +3] be +x+ pe i? +1) [x] [+1] o 0] 
p++] | p++] ta W+ i] +1] [x] 0] (0) 
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. 0 n e] [+1] [x7] [e+ 1] +a Lê+x+1] 
1) 0 0 [0 0 0 0 (0) 0 
0] 0 0 ie] [x+]] e3 p+] [2+ x] [e+x+1] 
ix) w [x [x] pat x [x+ 1] i] bet+xt+ tl] e+] 
t+] 0 (+1) +a] +1 LB+x+1] pÂ i] fl 
[x7] U A [x+ 1] [2+x+1] [+a] [x] +1] i] 
[+ 1] @ e+ (] eA E] K +x+1] [x+1] +a 
B+ x] O +g L’+x+1] [1] L+ 1] [x+]] J e3 
++I] |0 ++ p+ bd i] +x] Â [x+]] 
3 + | (0) [1] [>] [x + 1] 
[0] [0] [1] [a] [x+ 1] 
[1] [1] [0] w+ fl 
W] | k+ O [1] 
[x+1] | [x+] bl [1] [0] 
* 9 [1] [x] [x+ 1] 
[0] [0] [0] [0] [0] 
[1] [0] [1] [x] [x +1] 
[x] [0] b] [1] [x + 1] 
[x+ 1] | 0) [k+] B+ O 


7. [ax +b] + [ex + d] =[(a+ Ax + (b + d)]; 
[ax + b][cx + d] = [(ad + box + (ac + bd)]. 


11. Consider the product of [a] with itself. 


Section5.3 (page 138) 


1. (a) Field (Use Corollary 4.19 and Theorem 5.10.) 
(c) Not a field. (Show that x* + x7 + 1 is reducible) 


3. By Corollary 5.5, the distinct elements of F[x}(x — a) are the classes of the form 
[e] with ce F. Use this to show that F[x](x — a) is isomorphic to F. 
5. (a) Verify that the multiplicative inverse of r + sV3 is” = “V3, where t= P — 37. 


7. By Corollary 5.12, there is an extension field K of F that contains a root c, of 
S(x). Hence, f(x) = (x — epg(ax) in Kx]. Use Corollary 5.12 again to find an 
extension field L of K that contains a root c, of g(x). Continue. 


9. (a) Use Corollary 4.19 and Theorem 5.10. 
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Chapter 6 


Section 6.1 (page 148) 


1. To see that K is not an ideal, consider what happens when you multiply a 
constant polynomial by a polynomial of positive degree. 


(a) If re Rand IgE, then r =r ° lpel. Hence, RE and thus R = F. 


11. (a) (0) ={0} and (1) = (2)= @=@=2Zs, © 0) = (0; 0) =(= 0 = 
(11) = Z1» (2) = (©) = (10) = {0, 2, 4, 6, 8, 10}; (4) = (8) = {0, 4, 8}; (3) = (9) = 
{0, 3, 6, 9}; (© = {0, 6}. 


13. No; see the answer for Exercise 11. 


17. (a) Z N J contains 0g (Why?) and hence is nonempty. If a, be ZN J, then a, bef, 
so that a — b is in J by Theorem 6.1. Similarly a — b E J. Henœ, a — bE A J. 
Now show that if r € R, then ra eI N J and rae INJ. Apply Theorem 6.1. 


27. Use Theorem 6.1. K is nonempty because f (0x) = 05 by Theorem 3.10, and, 
hence, 0, EK. If a, be K, then f(a) = Og and f(b) = Os by the definition of K. To 
show that a — b E K, you must prove that f(a — b) = Og. If rE R, you must prove 
that f(ra) = 0, in order to show that ra E K. 


29. An element of (m) N (n) is divisible by both m and n; hence, it is in (mn) (see 
Exercise 17 of Section 1.2). 


31. (=) If (a) = (b) = (0g), show that a = Og = b and, hence, a = bu with u = 1p. If 
(a) = (b) # (Op), then both a and b are nonzero and a = a> lg E (a). Therefore, 
a E(b), so that a = bu for some ue R. Similarly, b = av for some v E R. Hence, 
a= bu = avu, which implies that uv = 1p (Theorem 3.7), so that v is a unit. 


35. If J + (3), show that J contains an element b such that (3, b) = 1. Use Theorem 1.3 
to show that 1 € Jand, hence, by Exercise 9(a), I = Z. 


41. (a) See Exercise 27 in Section 3.1. 


43. (b) If f(x) € Z[x] has constant term e, then x divides f(x) — c, so that f(x) = 
c (mod J) by part (a). Hence, f(x) + J = c + J by Theorem 6.6. If b, ¢ are 
distinct integers, then b — ¢ cannot be divisible by x (Why?). Hence, b — c € J 
and b + c (mod J). Therefore, b + J + c + J by Theorem 6.6. 


47. Half proof: Suppose that u eS. If u? = u and S = (u), then S is a subring since it 
is an ideal. If s € S, then s = ru for some r € Z,. Hence, su = (ruju = ru? =ru =s. 
So u is the identity element in S. 


2 


Section 6.2 (page 159) 


3. By Exercise 10 in Section 6.1, the kernel of fis either (0z) or F. Explain 
why it cannot be F. Hence, fis injective by Theorem 6.11 and, therefore, an 
isomorphism. 


5. Consider the case when R = Z and Tis the principal ideal (#). Then Z/f is just 
Z, Is Z, always an integral domain? 


7. Apply the First Isomorphism Theorem to the identity map from R to R. 


0 0 
9. (b) The ideal consisting of all matrices in R of the form ( b y with b, c 
integers. G 
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13. Half proof: Let a+ IE R/L If there is an element b E R such that a — b? E1, 
then a = $? (mod }). Soa + J=& + I = (b + D(b +1) by Theorem 6.6. Hence, 
b + Iis a square root of a+ Jin R/I. 


17. (a) fatb)=(atb)t+ha@tbdbt+)N=H(@tN+Ot+),@t)N+Ot+N)= 
@+Ia+J)+ (b+ I, b+ J)= f(a) + f(b). A similar argument shows that 
Jab) =f@fe). IAT 

21. Let fZ > Z, be given by f([a]9) = [a]s, where [a], denotes an element of 
Z,, First, show that fis a well-defined function (independent of the choice 


of representative in the congruence class). Then show that fis a surjective 
homomorphism of rings with kernel (5). Apply the First Isomorphism Theorem. 


2%. If r + Jisa nilpotent element of R/J, then for some n, we have Og + J = (r + JF = 
r" + J. Hence, r" € J (Why?), which means that r” is nilpotent in R. Hence, (r°y” = 
Oz for some m. But this says r € J, and, hence, r + J is the zero coset Og + J. 

29. Define a function iS > R x R byf i i = (a, €). Show that f is a surjective 


homomorphism of rings with kernel Z. Apply the First Isomorphism Theorem. 


Section6.3 (page 166) 


1. By the definition of composite, n = cd with 1 < |c] < |n| and 1 < |d| < |n|. Hence, 
c and d cannot be multiples of n. Thus cd = n E (n), but c € (n) and d Ẹ (n). 
Therefore, (n) is not a prime ideal. 


bod 
` 


(a) Use Theorem 2.8 to show that p is prime if and only if Z, isa field. But 
2, = £/(p); apply Theorem 6.15. 


5. The maximal ideals in Z; are {0, 3} and {0, 2, 4}. 


7. If Risa field, use Exercise 10 of Section 6.1. If (0p) is a maximal ideal, use 
Theorem 6.15 and Exercise 7 of Section 6.2. 


9. If p = cd, then cde (p). Since (p) is prime, either c € (p) or dE (p), say c € (p). 
Hence, ¢ = pv for some ve R. Use this and the fact that p = cd to show that d is 
aunit. 


15. (b) M is not prime because, for example, 3-7 = 0 € M, but 3 € M and 7 € M. 


17. Iis an ideal by Exercise 22 of Section 6.2. Use the fact that J # S (Why?) and 
surjectivity to show that J + R. If rs € J then /(rs) €J. Hence, f(r)f(s) € J (Why), so 
that f(r) €J or f(s) €J by primality. Therefore, r EJ or s€ J, and, hence, Jis prime. 


19. (=) Suppose R has a unique maximal ideal M. Then M + R by definition, and 
so M is contained in the set of nonunits by Exercise 9 of Section 6.1. If cisa 
nonunit, then the ideal (c) + R (Why?). So (c) is contained in a maximal ideal by 
hypothesis. But M is the only maximal ideal. So c € (e) © M. Since every nonunit 
is in M, the set of nonunits is the ideal M. 


Chapter ? 


Section 7.1 (page 180) 


r 2 oy’ 2 3 Lea wea 
i G 3 i) =(3 1 >) and (3 l ) =f 3 +), Bach of the other 


permutations is its own inverse. 
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3. (a) 18 (c) 24 (e) 6. 


2 0 
5. (a) “i 1) O (; s) 


9. 0 ro rı r2 s t u 
To To ri ry s t u 
ry ri rn To u s t 
ry ry To rı t u s 
s s t u To ri ry 
t t u s r2 To r 
u u s t ri ry To 


13. S; X Zis nonabelian of order 12 and D4 X Zis nonabelian of order 16. 


17. (a) Gis a group. Closure: If a, b EQ, then a + b = a + b + 3 EQ. Associativity: 
(as b)ac=(a+b+3)»c=(a+b+3)+c+3=a+b+c+6= 
a+(6+c+3)+3=a+(+c+ 3)= a» (b » c). Verify that —3 is the 
identity element and that the inverse of a is —6 — a because a » (—6 — a) = 
a + (—6 — a) + 3 = —3 and, similarly, (—6 — a) * a = —3. (c) Gisa group 
with identity 0. The inverse of a is —a/(1 + a). 


19. No; there is no identity e satisfying both a + e = a and e « a = a for every a. 


23. Most of the argument in Example 15 of Section 7.1.A can be carried over to this 
situation by replacing “+ 0” by “= 1” throughout. To show that the inverse of a 
matrix in SL(2, R) is also in SL(2, R), use the formula for the inverse of a matrix 
(in Example 7 of Section 3.2 and in Example 15 of Section 7.1.A). 


27. If ab = ac, then b = eb = (a~'a)b = a`!(ab) = a“(ae) = (a 'a)e = ec = c. 

31. Leta, b,c be distinct elements of T. Let o € A(T) be given by a(a) = b, a(b) = a, and 
a(t) = t for every other element of T. Let T € A(T) be given by 7(a) = b, 7(6) = c, 
7(c) = a, and T(t) = t for every other element of T. Verify that (ø ° T)(a) = a and 
(7° a)(@) = c; hence, a ° T #T°O. 


Section 7.2 (page 201) 


le = ee = (Coc = ec =c. 


iL esc 
5. If f(a) = f(b), then a`! = 67! Hence, (a~!)~! = (b71)!. Therefore, by Corollary 7.6, 
a = (a")* = (6)! = b. Thus fis injective. Corollary 7.6 can also be used to 

prove that fis surjective. 
7. (a) 2 {(c) 6. 
9. (a) Ui has order 4; Uy, has order 8. 

13. If Gisa finite group of order n and a E G, then then + 1 elements a, a, a’, 
a’,..., a" cannot all be distinct. Hence, a’ = @ for some i andj with n = i > f, 
which implies that a“! = e with 0 < i — / = n (Why?). What does this say about |a}? 

17. (a) x = a7 'b is a solution of ax = b because a(a™!b) = (aa“')b = eb = b. If cis 
also a solution, then ac = b = a(a~'b). Hence, c = a~'b by Theorem 7.5(2). 

27. If a, b €G, then by hypothesis, aa = e, bb = e, and abab = e. Left multiply both 
sides of the last equation by ba and simplify. 
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29. Let x = a~'cb™ and show that axb = c. To prove uniqueness, assume ayb = c 
and show that y = a`'eb™'. 


t2 3 k 23 F 
31. (b) In Sy leva = (; l 3) ana = (1 3 >). Verity that jo = 2, = 2, 


2r 3 
33. Let ja] = m and |b| = n, with (m, n) = 1. If (aby = e and ab = ba, then a*b* = 
(abf = e, so that a = b™%. Hence, a” = (6-*)" = (6")* = e. Therefore, m |kn 
by Theorem 7.9 and, hence, m |k by Theorem 1.4. Similarly, n |k. So mn | k (see 
Exercise 17 of Section 1.2). 


35. ab = b'a—> aba! = b= aba! = ba ‘Yaba™')(aba') = (bt = b? = e 
(because bÉ = e) => ab} = a= b = e. Therefore, ab = la = bba = eba = ba. 


ab = G 2 a and (ab) = ab. 


Section 7.3 (page 211) 
1. (a) (1) = Uys (2) = (8) = {1, 2, 4, 8}; (4) = {1, 4}; (7) = (13) = {1, 4, 7, 13}; 
11) = £1, 11}; (14) = {1, 14}. 
5. (2) ={..., —8, —6, —4, —2, 0, 2, 4, 6, 8, ..-} 
1 E l 
7. Q)= re: 1, 2, 4, 8, 16,. a 
9. 1=242=2;4= 24,7= 1358 = 2; 11= 2- 13;13 = 13t 14 = 23-13. 
11. Using additive notation, we see that the group is cyclicwith generator (1, 1): 
1(1, 1)=(1, 1} 2(1,1)=(0,2% 3(1,1)=(1,0; 4(1, 1) = (0, 1); 5(1,1)= 
(1,2); 6(1, 1) = (0, 0). 
13. Since eg is the identity in H, eyey = eg. Apply Exercise 1 of Section 7.2 with c = eg. 
15. (a) If a, beH N K, then a, b € Handa, be K. Since His a subgroup, abe H 


and a'e H. Similarly, ab € K and ae K. Hence, abe HN Kandate Hk. 
Therefore, H N K is a subgroup by Theorem 7.11. 


29. Since H is nonempty, there is some c € H. By hypothesis, e = cc € H. If de H 
then since e € H, we have d = ed“ € H. Use this and the fact that d = (d7)" to 
show that c, dE H implies ede H. Apply Theorem 7.11. 


31. If xax and x bx Ex" with a, b € H, then ab € H, and, hence, (x ax)(x"'bx) = 
x'(ab)x €x" Hx. Show that (xx)? = xta 'x ex Hx. Apply Theorem 7.11. 
33. Theorem 1.2 may be helpful. 
35. (=) If a is in the center of G, then ag = ga for every g € G. Hence, Cia) = 
{g E Glag = ga} = G. 
41. If a’, b" € H, then since G is abelian, a"b” = (ab)" € H. Also (d')' = a” = 
(a*y € H. Apply Theorem 7.11. 
43. The subgroups of Zy are {0}, {0, 6}, {0, 3, 6, 9}, £0, 4, 8}, {0, 2,4, 6, 8, 10}, and Zp. 
47. See Exercise 33 of Section 7.2. 


49. G = (a) = {na |nE Z}. Assume that ge Gis a solution of x + x = a. Then g = ka 
for some integer k. Hence, ka + ka = a, which implies that a has finite order 
(Why?). This is a contradiction, so x + x = a has no solution in G. 

53. If (m,n) = 1, use Exercise 47. To prove that if Z,, x Z, is cyclic, then (m, 2) = 1, 
we prove the equivalent contrapositive statement: If (m, n) # 1, then Z,, x Z, is not 
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cyclic. If (m, n) = d > 1, then m = dr, n = ds, and drs < mn. If (a, b) EZ, X Zn, 

then drs(a, b) = (drsa, drsb) = (sma, rnb) = (0, 0). Therefore, the order of 

(a, b) is a divisor of drs (by Theorem 7.9 in additive notation) and, hence, strictly less 
than mn. So (a, b) does not generate Zm X Z, (a group of order mn) by Theorem 7.15. 


57. (a) Show that Uyg = {1, 5, 7, 11, 13, 17} is generated by 5. 


Section 7.4 (page 223) 

1. (a) Homomorphism: f(x + y) = 3(x + y) = 3x + 3y = f(x) + f(y). Surjective: 
If z ER, then f(t/3) = 3(¢/3) = t. Injective: If f(x)= f(y), then 3x = 3y, and, 
hence, x = y. 

5. gisa homomorphism since for any a, b, g(a + b) = 2(a + b) = 2a + 2b = g{a) + g(b). 
You can easily compute f (0), fQ), . . . , f(8) to see that fis injective and surjective. 

7. fis a homomorphism since for any a, b, f (ab) = |ab| = |allb| = f(@)f(6). Why isf 
surjective? 


; PO _fi NA o\_ fl o0). 

11. g is ahomomorphism since for any a, b, 9(a)g(b) = ( 0 ie iA) = i T = 
g(ab). If g(a) = g(b), then f a = ( a! which implies that a = b. Hence g 
is injective. Gi 

13. Show that both groups are cyclic of order 4 and use Theorem 7.19. 

15. f(a®) = f(eg) = eg = f(a)’. For positive integers, use induction: f(a?) = f(a) = 
f(a)’. if f(a’) = f(a}, then f(a*™*)) = fata’) = faf 0) = f@fa) = fay*’. 
Hence, f(a") = f(a)" for all n = 0. What about negative n? 

19. (=) If Gis abelian, then fis a homomorphism because f(ab) = (aby! = bat = 
a'b7 = f(a) f(b). In this case, f is an isomorphism by Exercise 5 of Section 7.2. 

21. Because f and g are homomorphisms, (g ° f)(ab) = g{ f(ab)] = gLf(a)f(b)] = 


8(F@)a( FO) = (g ° fa) (g °F). Hence, g ° fis a homomorphism. If 
c EK, then since g is surjective, there exists b € H such that g(b) = ce. Since fis 


surjective, there exists a € G such that f(a) = b. Thus, (g ° fXa) = g( f(a) = 
g(b) = cand g° fis surjective. To complete the proof, show that f is injective. 

29. If a” = eg, then by Exercise 15 and Theorem 7.20, f(a)" = f(a") = f(ea) = ex. 
Similarly, if f(a)" = ey then f(a”) = f(a)" = ey = f(eg). Hence, 2" = eg since fis 
injective. So a" = eg if and only if f(ay’ = ey. 

31. If a, b EF, then because fis a homomorphism, /(ab) = f(a)f(b) = ab. So ab EF, 
and F is closed under the group operation. Use Theorem 7.20 to show that the 
inverse of every element of F is also in F. Then use Theorem 7.11. 

35. K,= {1, 4}. 

37. If f, g € Inn G, then f(a) = cae and g(a) = d—1ad for some c, d. Show that 
(f ° g)(a) = (dey 'a(de) and, hence, f° g E Inn G. Show that the inverse function 
h of fis given h(a) = cac™? = (cae € Inn G. Use Theorem 7.11. 


41. See Example 6. 


43. Verify that every nonidentity element of U; has order 2 but that this is not true 
for Uio- Hence, there is no isomorphism f by Exercise 29. 
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51. (a) If 0.(x) = 0,(y), then xe“! = ye~’. Hence, x = y by Theorem 7.5. Therefore, 0, 
is injective. If x € G, then xc € G and 6{xc) = (xe)e"! = x. Hence, 8, is surjective. 

59. (a) Show that 4 and v both induce the same inner automorphism (that is, hah = 
wav for every a € D4). Do the same for rg and m, for r, and rz, and for d and t. 
Then show that the inner automorphisms induced by A, rp, r;, and d are all 
distinct (that is, no two of them have the same action on every element of D4). 


Section 7.5 (page 233) 
ł. (a) (173) (c) (1476283). 
3. (a) (12)(45)(679) (c) (13)(254)(69)(78). 
5. (a) 2 (c) 4. 
7. {a} odd (c) even. 
9. (a) 3 (c) 60. 
11. There are eight 3-cycles (list them), each of order 3. Each of (12)(34), (13)(24), 
and (14)(23) has order 2. The identity (1) has order 1. 
15. (ajaz + + + ay) = (ayay)(Qyag_1) + + * (a1a4)(aya3)(a;a2). There are k —1 transpositions 
(one for each of a), a3,..., @). k — 1 is even if and only if k is odd. 
19. Suppose T = 0,0, ++ * g „ where the g; are disjoint cycles, with g, having order 


Ki, 02 having order k,,..., and g, having order k,. Show that 7" = (1) if and 
only if ø; = (1) for every i. Use Theorem 7.9 to show that k;| n for every i. 


23. Use Theorem 7.12. 


25. Verify that ra = 0 '7; use this to show that any product of powers of g and 
powers of 7 is one of: f, €’, 0°, a° = (1), T, OT, O77, OF °F. 


29. There are three possible cases (where a, b, c, d are distinct symbols): (2b)(ab), 
(ab)(ac), and (ab)(ed). But (ab)(ab) = (1) = (abe)’; (abXac) = (ach); and (ab)(ed) = 
(acb)(acd). 


35. Let = (ab) and express g as a product of disjoint cycles. Since disjoint cycles 
commute by Exercise 18, all cycles ing7a~! not involving a or b will cancel and 
oTo™ will reduce to the form «(ab)«", where « has one of the following forms (in 
which a, b, x, y, u, v are distinct symbols): (+ + + xaby + + +); (-* + xbay ++ +); 

(: -- xay * +- ubv- -XC "+ xay***)3(- «eub =»); or {ee xay t ss uby ss) 
Verify that x(ab)ic is a transposition in each case. 


39. (a) The argument used in Exercise 24(a) and (b) can be used here if S, is 
replaced by G, (12) is replaced by 7, B, is replaced by the set of odd permutations 
in G, and A, is replaced by the set of even permutations in G. In the Hint for 
Exercise 24(b), replace (12) by 7", which is odd (Why?). 


{b) See Exercise 24(c) and replace |S,] by |G]. 
{c} Use part (b). 


45. The idea is to find an injective homomorphism S, —> 4,,, and then apply part (4) 
of Theorem 7.20. First, note that any permutation in S, can also be considered as 
a permutation in S,+2. Let a be the transposition (n + 1, n + 2) in S,42. 
Define £:S, —> A, as follows. If g is odd, then f(a) = oa. If ø is even, then 
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f(T) = a. To show that fis a homomorphism, suppose that g and 7 are in S,. 
Consider four cases: (1) g and 7 are both even: (2) ø is even and 7 is odd; (3) g is 
odd and r is even; (4) g and 7 are both odd. Show that f(a7) = f(a)/(r) in each 
case. To show that fis injective, you must show that f(a) = f(T) implies that g = 7. 
Prove it in cases 1 and 4 and show that f(a) = f(T) cannot occur in cases 2 and 3, 


Chapter 8 


Section 8.1 (page 245) 
1. @) If Ka = K, then a = ea E Ka = K. So ae K. 
3. Kro Z {ro, Fi: 2; ra}; Kd = {d, h, t v} 


7.4 9. 1 11. 6. 
17. (a) 1,2, 3,4,6,8, 12,24 (c) l, 2, 4, 5, 8, 10, 16, 20, 40, 80. 
19. 27, 720. 


21. H N K is a subgroup of H and of K, and so its order must divide p by Lagrange’s 
Theorem. Hence, |H N K| is either 1 (in which case H N K = (e)) or p (in which 
case H = HN K= K). 

23. If e + aE G, then (a) is a nonidentity subgroup of G. Hence, G = (a). If |G] = |a| 
has composite order, say |a| = td, then (a’) is a subgroup of order d by Theorem 7.9. 
Use Theorem 8.7. 


25. 2. 


31. List the element of G in pairs: a, a`}; b, b7}; c, c~", etc. witha + a`’; b #7; 
c + c~}; etc. for aslongas possible. Use the fact that there is an odd number of 
nonidentity elements to show that at some point you must reach a nonidentity 
element k such that k = k~". What is the order of k? 


35. A proper subgroup has order », with 1 < # < pq and na divisor of pq. Use Theorem 8.7. 


41. If Gcontains no element of order 3, show that every nonidentity element has 
order 11. Apply Exercise 40, with p = 11. What do you conclude? 


Section 8.2 (page 252) 


le a b 
5. orf t) ew and (4 P) EG, then 


Ga ede ge tle ew 
aks -bfa f b+ =i te) ew, 


o idjo d J) \o 1 


7. G* = G X {e} is a subgroup by Exercise 16 of Section 7.3. It is normal by 
Theorem 8.11 since for any (c, d) € G x H and (a, e) € G*, (c, d) (a, e)(c, d) = 
(c, d'a, e(c, d) = (cac, ded) = (cae, e) EG. 

11. If c EG, let fbe the inner automorphism given by f(x) = c~!xe (see Example 9 
of Section 7.4). Since N is characteristic, f(N) SN, that is c7! Ne S N. Hence, N 
is normal by Theorem 8.11. 
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13. See Example 9 of Section 7.4 and Theorem 8.11. 


17. First, prove that K is a subgroup of G. To show that K is normal, we show that 
for anya € Gand k EK, a 'kacK: 


Saka) = fa) fky¥@ [f is a homomorphism.] 
=f@'fO/@ [Theorem 7.20] 
= fay eg f(a) [ke K] 
= fay 'f(@) = en. 
Therefore, a~'ka € K and K is normal by Theorem 8.11. 


19. Use Exercise 15 of Section 7.3 to show that NN K is a subgroup of K. If ge K 
and nE NN K, then ge G, nEN, and, hence, g~'ng€ N by the normality of 
Nin G. But n EN N K implies that n € K, and, hence, g~!ng € K by closure in 
K. Therefore, g ng E€ N N K, so that g (NN K)gSN 1 K. Hence, N N Kis 
normal in K by Theorem 8.11. 


21. If ne Nand k EK, use normality to show that K"(n-'ka) = (k'n ka is in 
KNN=€). 


23. (a) If a EN, then Ne = N and Na are disjoint cosets (Why?). Since [G:N] = 2, 
these two cosets contain all the element of G. Therefore, any element that is not 
in N must be in Na. 


27. Partial proof: If N is normal and ab = n E N, then ba = babb™ = bnb“ and 
bnb € N by normality. 


29. Let N = (a). Then H = (c*) for some k by Theorem 7.17. If g € G, then 
g lag € N by normality; hence, g~'ag = a‘ for some t. Consequently, for any 
aM EH, Bldg = (g"'ag)" = (a°)” = (d )'EH. 
35. N isa subgroup by Exercises 15 and 27 of Section 7.3. Show that Nis normal in G. 


37. By hypothesis, the cyclic group (a) is normal. Hence, b~'ab € (a), that is, bab = a* 
for some k. 


Section 8.3 (page 260) 
3. Partial Answer :(Mh)(Mr,) = M(h°r,) = Md; (Mr) (MA) = M(r,° h) = Mt = Md. 


5. Show that Z,,/M is cyclic with generator 1 + M; then show that 1 + M has 
order 6 in &;3/M. 


7. Find the orders of the groups Uz, (5), and U/{5) (see Example 14 of Section 7.1 
or 7.1.A). Use Theorem 8.13 and 8.7. 


9. G/N = 23. 
11. Since ab = ba in G, NaNb = Nab = Nba = NbNain G/N. 
15. The identity element of the quotient group is the coset (0, 0) + «(5, 5) = (5, 5)). 
(1, 0) + (5, 5)) has infinite order since for any positive integer k, k(1, 0) = (k, )E 
((5, 5)). On the other hand, (1, 1) + (5, 5)) has order 5, as you can easily verify. 
19. If bE G, then Nb is a square in G/N, say Nb = (Nc)* = Ne. Since b € Nb, b = nê 
for some n € N. What do you know about elements of N? 


21. If Tg has finite order n, then Tp” = (Tg)” = Te=T, so g" € T. What does this tell you 
about the order of g"? And what, in turn, does that tell you about the order of g? 


23. R*/R** = Z. 
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25. (a) 9,5,7 (b) Ifm,neZ, then n(m/n + Z) =m + Z =0 + Z in Q/Z. 
31. What are the possible orders of Z(G)? Then, what are the possible orders of 
G/Z(G)? Use Theorems 8.7 and 8.15. 


37. Hint: Show that the function f:4/N x B/N — G/N given by f(Na, Nb) = Nab is 
well defined. Then show that if a€ A and be B, then Nab = Nba. Use this fact 
to prove that fis a homomorphism. 


Section 8.4 (page 270) 


1. f(a + bi) + (c + di) =f(atod+6+a))=b+d=fat bi + f(c + di); 
the kernel is Z. 


3. You provide the proof that h is a homomorphism. The kernel is (1) (so A is 
injective by Theorem 8.17). 


5. f(y) + (u, vy) = fŒ + u, y +v) = y + v = f(x, y) + f(u, v); so fis a 
homomorphism. You find the kernel. 


11. If [a], = [6],, then » | (a— b) by Theorem 2.3. Since k | n, it follows that k | (a — b). 
Use this fact to show that [ra], = [rb];. 

13. fis well-defined by Exercise 11. fis a homomorphism because f([a];¢ + [b] s) = 
Jle + bhe) = [a + bh = [a]; + [b] = f((a]ic) + f(4]is). Find the kernel and 
explain why it is isomorphic to Z,. 

17. (a} (0), 2, Za Z, Zo Zy 

19. (e), S3, and Z2. 

21. Kernel fis a normal subgroup of G, so what can it be? What does that imply? 

25. Show that fis a homomorphism. If ¢ is any integer, then f(0, —e) = 0 — (—c) = c; 
hence fis surjective. If (a, b) is in the kernel of f, then a — b = 0 and, hence, 

a = b. So (a, b) = (a, a) = a(1, 1) E (1, 1)). Show that any element of ((1, 1)) is in 
the kernel; hence the kernel is ((1, 1)). Apply the First Isomorphism Theorem 8.20. 

27. Verify that f: G x H —> G/M x H/N given by fla, b) = (Ma, Nb) is a surjective 
homomorphism with kernel M x N. Apply Theorem 8.16 and the First 
Isomorphism Theorem 8.20. 

31. Verify that f: Z —> Z, X Z, given by f(a) = (4];, [4],), is a homomorphism. Use 
Exercise 17 of Section 1.2 to show that the kernel is (12). Use brute force to show 
that fis surjective: Verify that f (1), f @), . ..,f(12) are all the elements of Z; X Z4. 


33. Since H = G/K by the First Isomorphism Theorem, it suffices to construct a 
bijection from the set S of all subgroups of G that contain K and the set T of 
all subgroups of G/K. If B is a subgroup of G that contains K, then B/Kis a 
subgroup of G/K, so define 8: S—> T by 6(8) = B/K. Then @ is surjective by 
Theorem 8.24. Show that @ is injective. 


Section 8.5 (page 277) 

1. (a) (123), (132), (124), (142), (134), (143), (234), (243). 
3. (1). 
5. Theorem 7.23 and Example 6 of Section 7.5. 


9. If Ñ + (1), then N contains a nonidentity element ø. If r # (l) is in N, then 
oo = (1) = ørt implies that g = r by Theorem 7.5. Hence, N = {(1), 7}; and N 
is cyclic of order 2. 
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Chapter 9 


Section 9.1 (page 285) 

3. (a) {(0, 0)}; {(0, 0), (1, 0)3; {(0, 0), (0, 1}; {(0, 0), (1, 1}; Zz X Z. 
5. Z, X Z 

9. No. 


13. (b) If Dis normal, then for any a, b E G, (a, e, e)(b, b, b)(a, e, eo ED. But 
(a, e, e)(b, b, b)(a, e, ey * = (aba™", b, b). Since this is in D, we must have aba“! = 
b, which implies that ab = ba. 

23. (a) Let M = ((123)) and N = ((12)) in Sy. 

25. First, verify that N; N (N1 <- N-Na ` © Na) =e) implies that when / + j, then 
N, A N, = {e} because N, SN, - ~- NiNa +> Ng. Use the homomorphism fin the 
proof of Theorem 9.1. If f(a, . . . , ax) = e, then a, = (a; - - - a;i) elaz oo a). 
Use Lemma 9.2 and Corollary 7.6 repeatedly to show that 
a EN; NN, +++ NNi e Ny = (e). Hence, fis injective by Theorem 8.17. 


27. (a) What are the normal subgroups of S3? 


Section 9.2 (page 297) 

1. If p'a = 0 and pb = 0, then p'(—a) = —(p’a) = 0 and p™*"(a + b) = p'p"(a + b) = 
Ppa) + ppb) = 0. Hence, a + bE G(p) and —a E Gp). Use Theorem 7.11. 

3. 0) Z02;202,:.02, ©2:0402, © 4020202; 
7,04,02, (2) 2,02,02,04,02,042;2,04,02,02,025 
2,02,04,0252,0£,0£4,04,025 204024, 0 Zs 2,0 2,0 £5. 

5. (a) 2,5 (c) 2, 2, 22, 23,3, 5, 5, 5, 5. 

7. (a) 2,2and2,2 (c) 2, 2% and2, 2% 

9. (a) G must contain an element of order p (Why?). If a has order p, then pa = 0. 


13. If q is a prime other than p and if q divides |G], use Exercise 12 to reach a 
contradiction. 


19. (a) Exercise 1 is the special case when every element of finite order has order a 
power of p. Essentially the same proof works here. 


Section 9.3 (page 302) 


3. {(12)(34), (13)(24), (14)(23), (1)} is the only Sylow 2-subgroup. The four Sylow 
3-subgroups are ((123)), (129), ((134)), ((234)). 


§. (a) lor4. 


7. (a) Show that G has a normal Sylow 7-subgroup. (c) Show that G has a 
norma! Sylow-11 subgroup. 


9. If a €G, then (Na) = Nin G/N, so that a EN. 


13. For each prime that divides |G], there is exactly one Sylow subgroup by the 
Second Sylow Theorem. Let p}, P» - . . , Pa be the distinct primes that 
divide |G], and let Ni, N2, ..., Ny, be the corresponding Sylow groups. Define 
SN, X Nz X +++ X Ng— Gby f(a, ag, ..., Ag) = aaz * - a. The proof of 
Theorem 9.1 shows that fis a homomorphism. Then Imf= N,N2---Ny= 
{ayaz + - - ak | aE Nà} is a subgroup of G by Theorem 7.20. The Sylow subgroups 
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of Im falso are N,, No, ..., Ni (Why?). By the definition of Sylow subgroups, 
[im f| = [Nil + [Nal - - [Nx] = [GI]. Hence, Im f = G, and fis surjective. By the 
definition of the direct product, |N, X Na X - <- X NJ = [Nl * (Nd --- [Nd = |G. 
Since N, X N2 X - - - X Ng and G have the same number of elements the 
surjective map f must also be injective (Why?). Therefore, fis an 
isomorphism. 


21. Show that there is a normal Sylow 3- or 5-subgroup. Note that if there are six 
Sylow 5-subgroups, G has 24 distinct elements of order 5 (Why?). Similarly, if 
there are ten Sylow 3-subgroups, G has 20 distinct elements of order 3. 


Section 9.4 (page 310) 
a (a) {ro}, {r2}, {n, rs}, {h, v), {d, t). 
. Look at H = {ro ri, ro, r3} in Dy. 


» (123), (124), ((134)), (234). 
. If Cis the conjugacy class of a E G, show that f (C) is the conjugacy class of f(a). 


oN U w= 


15. In the equation of Exercise 14(c), verify that each |C| is either 1 or a positive 
power of p. At least one |C] is 1 beacuse {e} is a conjugacy class. Since |N] is 
divisible by p, there must be more than one |C} = 1 and, hence, some nonidentity 
element of Z(G) in N. 


19. If b e N(N(K)), then b~'N(K)b = N(K). Hence, 5-'Kb © N(K), since KS N(K). 
Verify that both K and b~!Kb are Sylow p-subgroups of N(K) and, hence, 
conjugate in N(K). But K is normal in N(K), and so b-'Kb = K. Hence, b e N(K). 

2t. If Sis a Sylow p-subgroup containing H (Exercise 24), then every Sylow 
p-subgroup is of the from a~!Sa for some a € G and, therefore, contains a~! Ha. 


Section9.5 (page 318) 


i. First show that p? # 1 (mod q). [If p? = 1 (mod q), then q divides p + 1 or 
Pp — 1 (Why’). Use the facts that p < q and q # 1 (mod p) to show that both 
possibilities lead to a contradiction.] Then use Theorem 9.30. 


5. (a) 

e a a a b ab a’b ab 
e e a a a b ab ab a&b 
a a a a e ab ab ab b 
a a a e a ab a’b b ab 
& o e a a ab b ab ab 
b b ab ab ab a a e @ 
ab ab b a’b a’b a Fad a e 
ah | ab ab b ab e a a a 
ab @b ab ab b a e P a 


7. Use Exercise 13 of Section 9.3 and Theorem 9.9. 
13. {1,-1}. 
17. How many Sylow p-subgroups does G have? Use Corollary 9.16. 
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Chapter 10 


Section 10.1 (page 330) 


3. (a) True. Proof: a |b means b = au and c|dmeansd = cv. Hence, bd = aucv = 
ac(uv). 


5. If ais an associate of b, then a = bu for some unit u. Hence, bu = a = be, and, 
therefore, u = c, a contradiction. 


7. Suppose g = pu, where p is irreducible and u is a unit. Suppose g = rs; then rs = 
px, and, hence, p = (pu)u! = (ys)! = r(gu7!), Since p is irreducible, ris a unit 
or sa`! is a unit by Theorem 10.1. But if su7 is a unit, say su™’w = 1, then sis a 
unit. Therefore, q is irreducible by Theorem 10.1. 

17. (a) 5(ab) = 5((su — tv) + (sv + tu)i) = (su — te)? + (sv + uy = se — 2stuv + 
Pe + oe? + Istuv + Pa? = start Pe t+ te + Pee = (7 + Pu? + ve) = 
6(a)6(5). 

21. If Og +a ER, use Theorem 10.1 to show that a? can’t be irreducible and, hence, 
must be a unit. Hence, a is a unit. 


23. Suppose p = rs. Then p |r or p | s. Show that r or s must be a unit and apply 
Theorem 10.1. 


29. Assume that ô(a) = k for all nonzero ae R. If b # Og, then there exist q, r such 
that 1, = bq + r, with r = 0g or 6(r) < 6(5). The latter condition is impossible 
because ô(r) = k = 6(5). Thus r = 0g, and, hence, gis a multiplicative inverse of b. 


Section 10.2 (page 341) 


1. (ab) S (6) since b | ab. If (ab) = (b), then ab | b, say abu = b. Hence, au = 1p, 
contradicting the fact that a is a nonunit. 


5. See Example 3. 


11. If @ is an ideal other than R, then a is not a unit (Why?) and, hence, must be 
divisible by an irreducible element p (Theorem 10.12). Hence, (a) € (p), with (p) 
maximal by Exercise 10. 


13. (b) Verify that f: Z — Z, given by f(a) = [a], is a surjective homomorphism. 


15. By Theorem 10.8, Z = (b) for some nonzero b. If a € Zi], then a = bg + r with 
r = 0 or &(r) < (b), and, hence, a = r (mod J). By Theorem 6.6, the number 
of distinct cosets of J (congruence classes mod J) is at most the number 
of possible r’s under division by b. Show that there are only finitely many 
possible r’s. 


21. By Exercise 20, d = au + bv for some u, vE R. If e € Sis acommon divisor of 
a and b, then e necessarily divides d. Hence, d is a gcd of a and bin S. 


29. For some d, bc = ad. If a = ryrz: + + ry, d = 2422 ++ * Zw b = pipa * * * Pa and 
C= qiq: ++ q With each Pa qa Fe Zp irreducible, then pipz °  - p92 ** * qi = 
rira- 1222+ ++ Za: So each r; is an associate of p, or g,. But r; cannot be an 
associate of any p; (otherwise r; would divide the gcd 1px of a and b, which implies 
that the irreducible r; is a unit). 
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Section 10.3 (page 351) 

1. If x =a, y= 6, z = c isa solution of x” + y" = 2 anda = kt, show that x = a’, 
y = 6,2 =e isa solution of x* + y* = z*, contradicting the hypothesis. 

3. N(ab) = N((rm + snd) + (rn + sm) Vd) = (rm + snd’ — d(rn + sm? = Pm? + 
Ynnrsd + Prd? — drw — 2mnrsd — dên? = Pr? + SRE — dP — de = 
Q? — dee? — dr?) = N(a)N(6). 

9. (a) Use Corollary 10.22. 

17. (> Leta = u + v V—Sandb=wt+2zV-5S. lf r+ sV—SeP, thenr +s V—-5= 


2a + (1 + V—5)b = 2(u + vV—5) + (1 + V—S\w + 2V—5) = Qu + w — Sz) + 
(2u + w +z) V—S. Hence, r — s = (2u + w — 5z) — (2v + w + z) = 2(u — v — 32), 
so that r = s (mod 2). 


Section 10.4 (page 358) 
1. (2) [a, b] = [ak, bk] because a(bk) = b(ak). 
3. [a, 1g) + [b, 1r] = [ale + 1nd, Lele] = [a + 6,12] € R* and [a, 1p)[b, Lal = 
[ab, 1 pl r] = [ab, 1p] E R”; hence, R* is closed under addition and multiplication. 
The zero element [0g, 1] of Fis in R*. The negative of [a, 1p] is [—a, lgJER*. 


5. Verify that fF — (r + si|r, s €Q} given by f([a + bi, c + di) = Gee + 


2 
be + a ` : etd 
ok zisan isomorphism. 


11. mu + nv = 1 for some integers u and v by Theorem 1.2; u and v may be negative. 


Negative powers of a are defined in Fand, hence, in F, a = a! = a™*"" = qh = 
(oar = ON) = etm =b =b. 


Section 10.5 (page 364) 

1. (>) If f(*) isa unit in R[x], then f(x)g(x) = lg for some g(x). By Theorem 4.2, 
deg f(x) + deg g(x) = deg 1g = 0. Hence, deg f(x) = 0 = deg g(x), so that f(x), 
g(x) € R. Hence, f(x) is a unit in R. 

3. (=) Assume p isirreducible in R[x]. If p = rs in R, then either r or sis a unit in 
R[x]. Hence, r or sis a unit in R by Exercise 1. Therefore, p is irreducible in R by 
Theorem 10.1. 


5. Since cycz - + - Cm f(X) = g(x), each c; divides g(x). Therefore, c; is a unit in R 
because g(x) is primitive. 


9, First use the fact that R[x] is a UFD to show that R is an integral domain. If ¢ is 
a nonzero, nonunit element of R, then cisa nonzero, nonunit element of R[x] by 
Exercise 1. Hence, ¢ = p;p) ' '* Pr with each p; irreducible in R[x]. Theorem 4.2 shows 
that each p,;€ R. Hence, p; is irreducible in R by Exercise 3. Use the fact that R[x] is a 
UFD to show that this factorization is unique up to order and associates in R. 


Chapter 11 


Section 11.1 (page 374) 


7. a+ bi= (b —2a)i + a(l + 2i) + 0(1 + 39. Also, a + bi = (—2a)i + 
(a — b\(1 + 2i) + b(1 + 34). 
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9. Verify that ((—3/V2) — V3)V2 + V3(V2 + i) + V3(v3 - i) =0. 


11. If the subset is {Oys ty, uz,..., Upg then 1-0) + Opu, + Opu, +--- + O-u, = Oy, 
with the first coefficient nonzero. 


13. There exist c,€ F, not all zero, such that eyv, + ++: + cv, = Oy since the v; are 
linearly dependent. The set {t,..., Ug, Wy, - ., W} is linearly dependent because 
City ++ + uy + Open, + ++ + Opw, = 0, and not all the coefficients are zero. 


15. Foranyr + EC, r+ si = (;-3)s +(e + di). Hence, {b, c + di} spans C 


over R. Prove that it is also linearly independent over R. 


23. (a) If a + bV2 + cV3 =0, then a + bV2 = —cV3. Squaring both sides and 
rearranging, show that 2ab V2 = 3c? — a” — 2b’. If ab + 0, then V2 = 
(3c? -@- 2b*)/2ab €Q, which contradicts the fact that V2 is irrational. Hence, 
a = 0 or b =0. If a= 0, then bV2 + cV3 = 0. Square both sides and make a 
similar argument to show that bc = 0. Hence, b = 0 or c= 0. Buta = Qand b =0 
imply that c V3 = 0, whence, ¢ = 0. Similarly, a = 0 and c = 0 imply that b = 0. 

33. Suppose cyu; +--+ + cju, + dw = Oy. If d + Op, then w = d'enn — dot, — 
- ++ —d ou, a contradiction. Hence, d = Op. Then all the c; = Of because 
{uj ..., u,} is linearly independent. 

37. (© = (iii) Suppose S = {vı ..., Vag spans V over F. Then some subset T of S is 
a basis of V over F by Exercise 32. Since [V:F] =”, T must have n elements, and, 
hence, T = S. Use Exercise 36 to prove (ii) = (iii). (iii) implies (i) and (ii) by the 
definition of basis. 


Section 11.2 (page 381) 


3. Both F(u + c) and F(u) contain F by definition. Since c € F and u € F(u), 
u + ce F(u). Therefore, F(u) 3 F(u + c), since F(u + c) is the smallest subfield 
containing F and u + c. Conversely, u = (u + c) — ce F(u + c), so that 
F(u) & F(u + c), since F(u) is the smallest subfield containing F and u. 
Therefore, F(u + c) = F(u). 


5. (a) Verify that 3 + Si is a root of x -6x+34. {c) Verify that 1 + Visa 
root of x7 — 3x? + 3x — 3. 


7. By hypothesis, u is a root of some p(x) € F [x]. But F [x] & K[4], so that u is a root 
of p(x)e K[a]. 


9. VT isa root of 2 — t E Qmx]. 
11. 6. 


15. By the Factor Theorem, a + biis a root of f(x) = (x — (a + 5d) — (a — Bi). 
Verify that f(x) has real coefficients. 


17. (a) x4 — 22 — 4, 


21. m is a root of x* — q* € Q(m^[x] and, hence, is algebraic over Q(7*). Therefore, 
{1, T, 7’, T°} is a basis by Theorem 11.7. 


Section 11.3 (page 387) 


3. Many correct answers, including (a) {1, V5, i, V59 
(9 {1, V2, V3, V5, V6, V10, VIS, V30}. 
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5. Use Corollary 4.19 to show that 2? + 1 is irreducible over Q( V3) and thus is 
the minimal polynomial of i over Q( V3). Hence, [Q( V3, ) : Q(V3)] = 2 and 
[Q(V3, ’):Q] = (Q(-V3, HAV [Q(-V3):Q] = 2-2 = 4. 

7. [K(u):F] is finite by Theorems 11.7 and 11.4. Hence, u is algebraic over F by 
Theorem 11.9. If p(x) € F [4] is the minimal polynomial of u over F and 
q(x) € K[a] is the minimal polynomial of u over K, then g(x) | p(x) by Theorem 11.6. 
Hence, by Theorem 11.7, [K(u):K] = deg g(x) = deg p(x) = [Fu):F]. 

9. [F{u):F] and [K(u):F (u)] are finite by Theorems 11.4, 11.7, and 11.9 and 
Exercise 8. Apply Theorem 11.4 to FS F(u) S K(u). 


11. (a) Theorem 11.4 applied to FS F(u) & F(u, v) shows that m = deg p(x) = 
[F(u):F] divides [F(u, v):F]. Similarly, n | (F(u, v):F]. Hence, mn | [F(u, v):F] by 
Exercise 17 of Section 1.2. Use Theorem 11.4 and Exercise 7 to show that 
[F(u, v):F] = mn. Therefore, [F(u, v):F] = mn. 

13. Let A(x) E F(u)[x] be the minimal polynomial of v over F(u); then A(x) | q(x). By 
Exercise 1 l(a) and Theorems 11.4 and 11.7, (deg p(x)) (deg 9(x)) = [F(u, v):F] = 
[F(u, v):F(u)] [F(u):F] = (deg A(x))(deg p(x)). Therefore, deg A(x) = deg q(x), 
and, hence, g(x) = kh(x) for some k E K. Since h(x) in irreducible over Ffu), so is 
aA). 

15. If u is algebraic over E, then it is algebraic over F by Theorem 11.10 and 
Corollary 11.11. 


Section 11.4 (page 393) 
3. Q( V5, i) isa splitting field; it has dimension 4 by Exercise 3 of Section 11.3. 
7. The minimal polynomial p(x) of u is irreducible in F[x] and has a root in K. 
Therefore, p(x) splits over K = F(u). 


11. The fourth roots of —1 are (+ V2/2) + (V2/2)é, so that Q( V2, i) is a splitting 
field. 

15. x7 + 1 is irreducible in Z,[x] by Corollary 4.19. Hence, by Theorem 5.11, £,[x]/(x? + 1) 
isa field of nine elements that contains the roots [x] and [24] of 27 + 1. 

21. If p(x) € K[x]is irreducible and u is a root of p(x), then K(u) is algebraic over K 
by Theorem 11.10. Therefore, «is algebraic over Fby Corollary 11.11. Its minimal 
polynomial g(x) over F splits over K and divides the irreducible p(x) in K[a] by 
Theorem 11.6. Show that p(x) has degree 1 and apply Exercise 19. 


Section11.5 (page 397) 
1. Every polynomial in F[x] is also in Efx]. 


7. (a) If f(x) =a." +--+ + aand f(x) = 0p then for each k > 0, (k1 p) a, = ka, = Op 
Since Fhas characteristic 0, klp # Og and hence, a, = 0. Therefore, f(x) = a. 
9. If fx) and f'(x) are not relatively prime, then their ged has a root u in some 


splitting field. Hence, u is a repeated root of f(x) by Exercise 8, so that f(x) is not 
separable. 


13. Use the proof of Theorem 11.18, as in Example 2. 
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Section 11.6 (page 404) 
3. na=atat---+a=lpat leat -:++lea=(Irt+-:++ lye = (alpa = 
Oga = Op. 
5. Let p= characteristic F = characteristic K. F has order p”, where m = [F:Z,], by 
Theorem, 11.23, and, hence, g = p”. Since [K:Z,] = [K:F] [F:Z,] = nm, 
Theorem 11.23 shows that K has order p™ = g’. 
13. Every element a of Z, is a root of 3? — x by the proof of Theorem 11.25. Hence, 


a =ain Zp which means that @ = a (mod p) in Z. If ais relatively prime to p in 
Z, then a is a nonzero element of the field Z, and, hence, has an inverse. 


17. Since £ = F, each has order p” for some prime p. By Theorem 11.25, 
E = @,(u,,...,4,) = F, where the u are all the roots of x — xin K. 


Chapter 12 


Section 12.1 (page 413) 
L If a(o) = c for every c EF, then @(c) = a '(c(0)) = c. 
3. Use Theorem 11.7 to show that o(c) = c for all cE u). 
5. Use Corollary 12.5 and Lagrange’s Theorem 8.5. 

9. (a) p(x) =x? +x41 (b) GalgQ(w) = Z2. 

11. GalgQ(+V2, i) = Z; x Z}. 


Section 12.2 (page 421) 


1. The number of intermediate fields is the same as the number of subgroups of 
Gal,K, which is finite by Theorem 12.11. 


5. Four, of dimensions 10, 5, 2, and 1. 


9. (a) Every subgroup of Z, = Gal;X (in particular, Gal,;K) is cyclic and normal 
by Theorem 7.17. By Theorem 12.11, Galp£ = Gal-K/Gal;K; apply Exercise 24 
of Section 8.3. 


11. (b) [Q( ¥/2):Q] = 4 since af — 2 is irreducible in Q[x] by Eisenstein’s Criterion. 
af + 1 is the minimal polynomial of i over ac) by Corollary 4.19. 


Section 12.3 (page 431) 
1. (a) Many correct answers, including Q E Q( V5) 5 Q( V5, V7)S 


Q(V5, V7, Y2 + V5)5 Q( V5, V7, Y2 + V5, Y1 + V7). 
5. (a) Aq consists of the subgroup H and the eight 3-cycles (123), (132), (124), (142), 
(134), (143), (234), (243). Show that H is normal in A,. Use the fact that all 


groups of order <4 are abelian to show that the series S4 2 A, 2 H 2(1) satisfies 
the definition of solvability. 


7. (a) +1 (ce) l,i (e) +1, 1/2 + 1V3/2, -1/2 + iV3/2. 


13. If Kis the splitting field of a cubic polynomial, then [K:F] is divisible by 3 
(Why?) and <6 by Theorem 11.13. Hence, the Galois group is a subgroup of S, 
(Corollary 12.5) of order 3 or 6. 
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17. (a) xf- 4 +4 = gê -2y. Q(v2, a) is a splitting field, where w is a complex 
cube root of 1. G= S, (e) xf + 6x? + 9x = x(x? + 3, Q(i-V3) is a splitting 
field. G= 2. (e) G = Ss. 


Chapter 13 


Chapter 13 (page 441) 


1. If ka = 0 (mod p), then p [| ka. But (p, k)-= 1 (Why?). Hence, p | a by Theorem 1.5, 
which is a contradiction. 


3. (a}0107 0512 2421 1479. 


Chapter 14 


Section 14.1 (page 448) 


3. If there isa solution, then 0, 1, or 2 is a solution by Exercise 2. Verify that this is 
not the case. 


9. x = —30 (mod 187). 
1h. x = — 18 (mod 210). 
13. x = 204 (mod 204,204). 


19. (=) If b — a = dk and mu + nv = d, then muk + nvk = b — a. Proceed as in the 
proof of Lemma 14.1. 


Section 14.2 (page 452) 
3. 7 is (1, 2) and 8 is (2, 3) in Z, X #5, So the productis (1 - 2, 2 - 3) = (2, 1). 


5. (=) F f(r) = f(s), then both r and s are solutions of the system x = r (mod m;), 
x = r (mod m)), ...,x = r (mod m,). 


Section 14.3 (page 456) 


1. (a) Repeated use of Corollary 14.6 shows that both are isomorphic to 
ZX Z, X Z; and, hence, to each other. 


Chapter 18 


Chapter 15 (page 469) 


3. (a) Begin as in the construction of the coordinate plane. Place the compass 
point on (1, 0) and make a circle whose radius is the segment from (1, 0) to 
(3, 0). It intersects the vertical axis at Q. The right triangle with vertices (0, 0), Q, 
(1, 0) has hypotenuse of length 2 and one side of length 1. Hence the angle at Q 


1 
(opposite the side of length 1)is a 30° angle, because sin? 7 30%. 


(c) Part (a) shows that a 90° angle can be trisected. Since a 30° angle can be 
bisected, a 45° angle can be trisected. 


5. cos 3f = cos(t + 2t) = cos t cos 2t — sin t sin 22 = cos ¢(2 cos*t — 1) — 
sin (2 sin t cos f) = 2 cos*# — cos t — 2 sin’t cos t = 2 cost — cos t — 
2(1 — cos*#)cos t = 4 cost — 3 cos t. 


(Copyright 2012 Cangngp Lemming AN Rights Ramrend. May not ba copie, eran, or dapticanedl, in whale ar ia part Dus to electronic dight, mas third party contest may de mppamai fran the eBook eaVor eChagmar(s). Edixriai erview bas 
eee hat any capers costed! dows oct mammiaity fisc the crani larning apeiare Camgigy Lemming Oawn the right to cece atkfcionel costed! at eay time if athempuem riptes cemrictirms require it. 


Section 16.2 585 


7. No. To prove this, show that x must be the root of a cubic polynomial in Q[a] 
that has no rational roots. 


9. No. 
15. If Vk EF, then F( Vk) = F. If Vk ¢ F, then the multiplicative inverse of a 


nonzero element a + b Vk of F(vk) isc + dV&, where ¢ = aj (@ — kb) and 
d = —b/(® — ki). 


Chapter 16 


Section 16.1 (page 480) 
1. Verify that Cis closed under addition and, hence, is a subgroup by Theorem 7.12. 
3. (a) 1 (c) 4. 
5. (a) 0000, 1000, 0111,1111 (c) 0000, 0010, 0101, 0111, 1001, 1011, 1100, 1110. 
11. (c) If the ith coordinate is denoted by a subscript, then (u + w); = u, + w,and 
(v + w); = v; + w, Hence, (u + v); = (v + w) if and only if u; = v. 
17. Many correct answers, including 00000, 11100, 00111, 11011. 
21. n=5. 


25. Verify that an element of B(n) has even Hamming weight if and only if it is the 
sum of an even number of elements of Hamming weight 1 (for instance, 110 = 
100 + 010). Use this to show that the set of elements of even Hamming weight is 
closed under addition. 


27. (a) 96059601 (c) .00058806 (e) .00000001. 


Section 16.2 (page 490) 


1. (a) /0 0 {c) /1 
I, 1 1 
1 0 0 
0 1 l 
3. fi 5. Several possible answers, including 
1 
1 
1 
1 
1 


O O Om et ee e OOO mm m O 
O O m O m m m m mæ m a O OOO 
oroorr OOF KF Ow eK Om 
= O O O m CS Oe e O mim 


13. An error is detected if and only if w is not a codeword. Note that w = u + e and 
that the set of codewords is closed under addition. 
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Section 16.3 (page 497) 

1. (a) FFO) =a" +: HaT H a, then f(x) +f) = Gt age" + + 
(a; + ax" + «+= + (ao + ag) = 0a" + --- + Ox! + --- + O because a; + a, = 0 for 
every q; E 2). 

3. Verify that 1 + x + x* has no roots in Z, and, hence, no first- or third-degree 
factors. If there is a quadratic factor, it is either the product of two linear factors 
or irreducible. Use long division to show that the only irreducible quadratic 
(Exercise 2) is not a factor. 


5. (a) Use the table to show that o is a root of f(x) = 1 + x +32 + x? + 24 It 
then suffices to show that f(x) is irreducible. Use the method of Exercise 3. 

7. (© If flay + ax +--+ + a1") = (0,0, ..., 0, then [a + ax +--+ aX] 
= [0], so that the kernel of fis the identity subgroup: Apply Theorem 8.17. 

9. (a) D(x) = x? + ax + a has roots 1 = a and a = a!. Hence, the correct word 


is 000000000000000. (© D(x) = x7 + ax + a has roots a? and a”, Hence, 
the correct word is 101010010110000. 


Appendix B 


Appendix B (page 519) 

1. (a) {—2, — 1,0, 1, 2, 3, 4, 5, 6, 7, 8} (c) {1, 2}. 

3. (a) Empty since V2 is irrational (c) Empty. 

7. (a, 0), (, 1), (a, €), ©, 0), 1), ©, ©), (c 0), (6, D, (c, ©. 
11. (a) yes (c) yes. 


13. (a} Many correct answers, including the functions f, g, A, k given by f(1) = a, 


SC) = b, f(3) = c, F@) = a; g(1) = c, g(2) = b, g(3) = a, g(4) = b; hI) = b, 
h(2) = a, h(3) = c, h(4) = c k(1) = c, k) = a, kG) =a, k(4) =b. (€) There 
are six bijections from C to C. 


19. If (a, d)E A X (BU C), then aE A andde B or de C. Therefore, (a, d)E A X B 
or (a, d) E A X C, and, hence, (a, d) €(A X B)U (A X C). Thus A x (BU OE 
(A X B)U (4A X C). Conversely, suppose (r, s) E(A X B) U (A X C). Then (r, s)E 
A X Bor (r, s)E€ A X C. If (r, s)E A X B, then re A and sE B (and, hence, 
s EBU C), so that (r, 8) EA X (B U C). Similarly, if (r, s5)E€4 X C, then (r, s)E 
A X (BU C). Therefore, (A X B) U (A X C)EA X (BU C), and, hence, the two 
sets are equal. 


. No; why not? 
. (a) If f(a) = f (b), then 2a = 2b. Dividing both sides by 2 shows that a = b. 


Therefore, fis injective. (c) If f(a) = f(b), then a/7 = b/7, which implies 
thata = b. 


27. (a) If (g ° f )(a) = (g ° f Xb), then g(f(a)) = g(f(b)). Since g is injective, f(a) = 
f(b). This implies that a = b because f is injective. Therefore, g ° fis injective. 


29. (a) Let dE D. Since g ef is surjective, there exists b € B such that (g ° f)(b) = d. 
Let c = f(b) € C. Then g(c) = g(f(5)) = (g ° f )(b) = d. Hence, g is surjective. 


23 
25 
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Appendix C 


Appendix C (page 528) 


1. P(O)is true since 0 = 0(0 + 1)/2. If P(k) is true, then 1 +2 + - -- +k =k +1/2. 
Add k + 1 to both sides and show that the right side is (k + 1)(k + 2)/2. This 
says that P(k + 1) is true. 


3. Let P(n) be the statement 2"~! < n!. Verify that P(0) and P(1) are true. If P(x) is 
true and k = 1, then 2! = k! and 2 =k + 1. Hence, (2*-')2-= Ak + 1), that is, 
2*<(k + 1)Ì. Thus Pk + 1) istrue 

7. Verify that the statement is true when z = 1. Suppose the statement is true for k, that 
is that 3 is a factor of 2+! + 1. Then 2**! + 1 = 3t, and, hence, 2**! = 3¢— 1. To 
show that the statement is true fork + 1, note that 2¢+)+! = 22+2+1 _ g2k+192 _ 
Bt — 1)4 = 12t — 4 = 3(4t — 1) — 1, and, hence, 24+0+! + 1 = 3(4t — 1). 


11. Verify that the statement is true when n = 1. Let B = {b;, b2,..., bn}. In defining 
an injective function from B to B, there are n possible choices for the image of b,, 
n — 1 choices for the image of b, (because b, can’t have the same image as by), 
n — 3 choices for the image of 53, and so on. 


13. (a) Verify that the statement is true when n = 2. Assume that a set of k elements 
has k(k — 1)/2 two-element subsets and that B has k + 1 elements. Choose b € B 
and let C= B — {b}. Every two-element subset of B consists either of two 
elements of C or of b and one element of C. There are k(k — 1)/2 subsets of the 
first type by the induction hypothesis. 


Appendix D 


Appendix D (page 534) 


3. (a) a ~ asince cos a = cosa. If a ~ b, then cosa = cos b and, by the symmetric 
property of =, cos 6 = cosa; hence, b ~ a. If a ~ band b ~ e, then cos a = cos b 
and cos b = cos ¢. Hence, cos a = cos c, and, therefore, a ~ c. 


5. (b) The equivalence class of (r, s) is the vertical line through (r, s). 
9. (a) Transitive (c) Symmetric. 
19. (b) Consider the subgroup K = {ro v} of Dy. 


Appendix E 


Appendix E (page 539) 
1. 4032. 


s Gea pea ln) 
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Appendix F 


Appendix F (page 543) 
i, Owes as a 
2 > © ii iy 


3. (a) The entry in position +j of 4 + Bis a; + by But a, + by = b; + d which is 
the entry in position i-j of B + A. Hence, A + B= B+ A. 


Appendix C 


Appendix G (page 551) 
1. (a)xt+x?+x° (0) (-11, 7.5, —3, 12, —5, 0, 3,0, 0, 0, . . .). 
3. (a) [(@y a -- -)® (bos bis . - J] @ (co ĉr- - -) 

= (a + by, a; + bi . . .) ® (eg, cn- -) 

= (ao + bo) + œ (ai + bi) + 0, ...) 

= (aa + (bo + co), 2) + (bi + €), - - -) 

= (Ap, a, .. .) ® (bo + co bi +, .--) 

= (av, 4, . - -) ® [bo br- - -) ® (os €n - J] 
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algebraic, 393 
code, 437, 471 
BCH, 492 
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block, 473 
corrects errors, 475 
cyclic, 498 
decoding techniques, 483 
detects errors, 476 
generator matrix, 478 
generator polynomial, 494 
group, 473 
Hamming, 477, 490 
linear, 471, 475 
pad, 437 
parity-check, 473 
secret, 437 
systematic, 477 
codeword, 473 
coding theory, 471 
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binomial, 537 
leading, 88 
polynomial, 86, 545 
column vector, 541 
combination, linear, 11, 367 
common divisor, 10 
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laws, 34, 35, 44 
ring, 44, 162 
commutator subgroup, 262 
compass, 459 
complement, relative, 511 
complete induction, 525 
complex 
conjugation, 74, 408, 429 
numbers, 49, 178, 191 
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composite function, 512 
composition factor, 269 
compound statement, 500 
computer arithmetic, 450 
conclusion, 503 
conditional statement, 503 
congruence(s), 25, 125, 145, 237, 443 
class, 25, 126, 147, 239 
class arithmetic, 130 
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ideals and, 141 
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74, 408, 429 
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constructible 

angle, 468 

circle, 461 
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point, 460, 461 
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method of proof, 507 
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contradiction, proof by, 506 
contrapositive, 503 

method of proof, 506 
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Galois, 415, 420 

one-to-one, 517 
coset, 147, 239 

decoding, 483 

leader, 483 

left, 248 

right, 239, 255 
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cryptography, 437 
cube, duplication of, 459, 468 
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disjoint, 229 

of length k, 228 
cyclic group, 206, 293 
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coset, 483 
maximum-likelihood, 472 
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DeMoivre’s Theorem, 426 

dependent, linearly, 368 

derivative, 395 

descending chain condition, 343 

designed distance, 494 

determinant, 225 

diagonal, main, 50 


dihedral group, 176, 190, 314 
dimension, 371 
direct 
factor, 284 
method of proof, 505 
product, 281, 288 
sum, 281, 288, 293 
disjoint 
coagruence classes, 29 
cycles, 229 
sets, 511 
distance, Hamming, 474 
distributive laws, 35, 44 
divides, 9, 96, 322 
divisibility, 9 
in F[x], 95, 125 
division algorithm, 4, 9, 526 
in F[x], 90 
division ring, 58 
divisor, 4, 9 
common, 10 
elementary, 295 
greatest common, 10, 16, 96, 
326, 340 
zero, 41, 64 
domain 
Euclidean, 322, 323 
of function, 512 
integral, 48, 65, 321 
principal ideal, 332 
unique factorization, 328, 336 
dream, freshman’s, 402 
duplication of the cube, 459, 468 
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algebraic, 376 
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image of, 516 
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of set, 509 
transcendental, 376, 550 
elementary divisor, 295 
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equal functions, 513 
equality of sets, 510 
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class, 357, 533 
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even permutation, 231 
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in rings, 62 
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algebraic, 382 
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finite dimensional, 371, 372 
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Galois, 417 
of isomorphism, 379, 380 
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quadratic, 464 
radical, 424 
ring, 550 
separable, 394 
simple, 376 
external direct product, 284 


F, 324 
F*, 366, 371 
Fix], 85, 125 
F{[x]ip(x), 130, 135, 376 
¥-automorphism, 408 
factor, 9, 96, 322 
composition, 269 
direct, 284 
group, 255 
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theorem, 107 
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domain, 336, 359 
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unique, 17, 100, 328, 336, 
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Last Theorem, 345 
Little Theorem, 212, 405, 438 
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algebraically closed, 
120, 392 
automorphism, 408 
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finitely generated 
extension, 383 
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Galois, 404 
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intermediate, 412, 420 
normal extension, 391 
prime subfield, 401 
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of quotients, 353, 358 
radical extension, 424 
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Toot, 388 
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abelian groups, 289 
dimensional, 371, 372 
field, 399 
group, 172, 186, 198, 281 
group structure, 242, 312 
order, 172, 186, 198 
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group, 262 
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fixed field, 412 
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inverse, 519 
one-to-one, 515 
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polynomial, 105 
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rational, 358 
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value of, 512 
Fundamental Theorem of 
Algebra, 123 
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Finite Abelian Groups, 293 
Galois Theory, 415, 418 
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GIN, 255 
@p), 290 
Gal;K, 408 
Galois, E., 407, 415 
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correspondence, 415, 420 
Criterion, 426, 428 
extension, 417 
field, 404 
Fundamental Theorem of, 
415, 418 
group, 407 
group of a polynomial, 426 
theory, 407 
Gauss, C. F., 345 
Gauss’s Lemma, 362 
Gaussian integers, 322 
ged, 10, 16, 96, 326, 340 
general linear group, 179, 194 
generator 
of a group, 209 
matrix, 478 
polynomial, 494 
geometric constructions, 459 


greatest common divisor, 10, 16, 96, 


326, 340 
group(s), 169 
abelian, 172, 186, 191, 289 
additive notation, 198, 207, 
238, 289 
alternating, 227, 230, 273 
automorphism, 218 
basic properties of, 196 
Cartesian product, 180, 195, 281 
Cauchy’s Theorem, 297, 299, 307 
Cayley’s Theorem, 221, 273 
center, 205 
classification of, 281, 295, 318 
code, 473 
congruence, 237 
conjugacy, 304 
coset, 239, 248 
cyclic, 206, 293 
defined, 172, 186 
definition and examples, 169, 183 
dihedral, 176, 190, 314 
direct product, 281 
factor, 255 
finite, 172, 186, 198, 242, 281 
finite abelian, 289 
finite nonabelian, 298 
finite, structure of, 242, 312 
finitely generated, 262 
Fundamental Theorem of Finite 
Abelian, 293 
Galois, 407 
general linear, 179, 194 
generator, 209 
homomorphism, 220, 263 
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indecomposable, 288 

inner automorphism of, 219 

isomorphic, 214, 216 

Isomorphism Theorems, 266, 
267, 272 

metabelian, 273 


multiplicative notation, 196, 198, 


238, 289 
nilpotent, 303 
normalizer of, 213, 308 
order of, 172, 186, 198, 318 
p>, 291, 312 
permutation, 169, 222, 231 
quaternion, 181 
quotient, 255, 263 
Tepresentation, 222 
tings and, 177, 237 
simple, 268, 273 
of small order, 316 
solvable, 424 
special linear, 182 
structure of, 242, 259, 312 
subgroup, 203, 237 
Sylow Theorems, 298 
symmetric, 173, 187, 227 
torsion, 298 
of units, 179 


Hamming, R. W., 471 
Hamming 

code, 477, 490 

distance, 474 

weight, 474 
homomorphic image, 77, 157 
homomorphism 

evaluation, 111 

of groups, 220, 263 

kernel, 154, 263 

natural, 156 

of rings, 75, 154 
Hungerford, John W., 592 
hypothesis, 503 

induction, 524 


ideal(s), 141 

ascending chain condition, 
334, 342 

congruence modulo an, 152 

descending chain 
condition, 343 

finitely generated, 144 

left, 143 

maximal, 164 

prime, 162 

principal, 144, 150 

product of, 150 

tight, 148 

ting, 141 

sum of, 149 

unique factorization of, 349 


idempotent, 66 
identity 

additive, 34, 44 

element, 172, 186 

map, 218, 512 

matrix, 48, 194, 540 

multiplicative, 35, 44 

Ting with, 44 
image 

of element, 516 

of function, 512, 517 

homomorphic, 77, 157 
impossibility proofs, 461, 467 
indecomposable group, 288 
independent, linearly, 368 
indeterminate, 87, 550 
index 

set, 511 

of subgroup, 240 
induction, 507, 523 

assumption, 524 

complete, 525 

hypothesis, 524 

principle of, 524 
infinite 

dimensional, 371 

direct product, 288 

direct sum, 288 

order, 172, 186, 199 
injective function, 515 
Inn G, 225 
inner automorphism, 219 
instructor, to the, xii—xiii 
integer, 3, 191 

algebraic, 350 

composite, 19 

Gaussian, 322 

prime, 17 

quadratic, 344, 351 

square-free, 346 
integral domain, 48, 65, 321 

arithmetic in, 321 

field of quotients, 353 
interdependence of 

chapters, xiii 

intermediate fields, 412, 420 

conjugate, 422 
internal direct product, 284 
intersection (of sets), 511 
invariant factor, 295 
inverse, 40, 172, 186 

of a cycle, 274 

function, 519 

multiplicative, 63 
invertible matrix, 64 
irreducibility 

in C[x], 120 

of p(x), 135 

in Q[a], 112 

in Rij, 120 


irreducible 

element, 323 

polynomial, 100, 101, 135 
isomorphic 

fields, 379, 380 

groups, 216, 243, 295 

rings, 70, 72 
isomorphism 

extension of, 379, 380 

of groups, 214, 243, 266 

preserved by, 79 

of rings, 70, 78, 157 

theorems, 157, 161, 266, 267, 272 


k-cycle, 228 

kernel, 154, 263 
Kronecker delta, 485, 541 
Kummer, E., 345, 349 


Lagrange’s Theorem, 240, 241 
Lame, G., 345 
Icm, 16, 344 
leading coefficient, 88 
least 
common multiple, 16, 344 
residue, 439 
Leep, David, xi 
left 
coset, 248 
ideal, 143 
regular representation, 222 
length 
of BCH code, 494 
of cycle, 228 
line, constructible, 461 
linear 
code, 471, 475 
combination, 11, 367 
congruences, 443 
group, 179, 182, 194 
independence, 368 
linearly (in)dependent, 368 
local ring, 167 
logic, 500 


MIC), M(Q), MIZ), M(Z,), 48 
M(R), 46 
main diagonal, 50 
map, 512 
identity, 218, 512 
zero, 75 
Marks, Greg, xi 
mathematical induction, 524 
matrix, 46, 540 
addition, 47, 541 
algebra, 540 
equal, 46 
identity, 48, 540 
invertible, 64 
main diagonal, 50 
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multiplication, 47, 542 
parity-check, 484 
product, 542 

ring, 46, 543 

scalar, 57 

standard generator, 478 
sum, 541 

zero, 47, 540 


order, 291 
maximum-likelihood decoding, 472 
McBrien, Vincent O., iii, 402 
member of set, 509 
message word, 472, 473 
metabelian group, 273 
methods of proof, 505 
minimal polynomial, 378 
modular arithmetic, 32 
modus ponens, S05 
monic polynomial, 96 
multiconditional statement, 508 

proof of, 507 
multiple root, 111 
multiplication 

congruence class, 32, 130 

polynomial, 88, 546 

scalar, 366 

in Z, 35 

in Z,, 32 
multiplicative 

identity, 35, 44 

inverse, 63 

notation, 196, 198, 238, 289 


Ry, 513, 516, 523 
natural homomorphism, 156 
nearest-neighbor decoding, 
475 
negation, 501 
negative, 60 
nilpotent 
element, 70 
group, 303 
norm, 346 
normal 
extension, 391 
subgroup, 213, 248 
normalizer, 213, 308 
notation 
additive, 198, 207, 238, 289 
congruence, 25, 238 
multiplicative, 196, 198, 
238, 289 
set-builder, 509 
translating between, 198, 207, 
238, 289 
nth root, 423, 426 
of unity, 426 
null set, 510 


number(s) 
algebraic, 386 
complex, 49, 178, 191 
constructible, 461 


odd permutation, 231 
one-to-one 
correspondence, 517 
function, 515 
onto function, 516 
operation, 511, 514 
Oprea, John, xi 
order 
of element, 198, 401 
of group, 172, 186 
maximal, 291 
in Z,, 3 


p-group, 291, 312 
parity-check 
code, 473 
matrix, 484 
matrix decoding, 489 
partition, 534 
Pascal’s triangle, 539 
permutation(s), 169, 184, 222 
of a set T, 170, 184 
even, 231 
odd, 231 
PID, 332 
point, constructible, 460, 461 
polygon, regular, 314 
polynomial(s), 85, 545 
addition, 88, 546 
associate, 100 
constant, 89 
degree of, 88 
derivative of, 395 
divisibility, 95 
division algorithm for, 90 
equal, 546 
equations of fifth degree, 428 
error-locator, 495 
function, 105 
Galois group of, 426 
generator, 494 
irreducible, 100, 101, 135 
leading coefficient, 88 
minimal, 378 
monic, 96 
multiplication, 88, 546 
primitive, 360 
reducible, 101 
relatively prime, 99 
ring, 125, 545 
root of, 106, 111, 394, 461, 466 
separable, 394 
positive common divisor, 326 
premise, 503 
preserved by isomorphism, 79 
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primality testing, 21 
prime, 17 
ideal, 162 
integer, 17 
relatively, 10, 99, 328 
subfield, 401 
primitive 
nth root of unity, 426 
polynomial, 360 
principal ideal(s), 144, 150 
ascending chain condition on, 334 
domain, 332 
principle 
of complete induction, 525 
of mathematical induction, 524 
product 
Cartesian, 51, 180, 195, 281, 512 
direct, 281 
of ideals, 150 
infinite direct, 288 
of matrices, 542 
semidirect, 288 
proof, 504 
for beginners, ix 
completion symbol for, 7 
by contradiction, 506 
impossibility, 461, 467 
methods of, 505 
techniques, 39 
proper 
subgroup, 203 
subset, 510 
public-key cryptography, 437 
public-key system, 438 


Q, 49, 178, 191-192 
Q, 181, 316 
QZ, 259 
Q[x], 112 
Qx], 336 
quadratic 
equation in Z, 36 
extension chain, 465 
extension field, 464 
formula, 114 
integer, 344, 351 
quantifiers, 502 
quaternion(s) 
division ring of, 58 
group, 181, 316 
real, 58 
quotients, field of, 353, 358 
quotient groups, 255, 263 
subgroups of, 267 
quotient rings, 152, 154, 162 


R, 45, 49, 178, 191, 263 
RII, 154, 162 

R[x], 120 

Rix], 86 
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tadical(s) 
extension, 424 
solvability by, 423 
range, 512 
rational 
function, 358 
numbers, 178, 191 
toot test, 113 
real numbers, 178, 191, 263 
real quaternions, 58 
received word, 472, 473 
reducible polynomial, 101 
reflexive, 26, 126, 146, 
239, 531 
relation, 531 
equivalence, 531 
relative complement, 511 
relatively prime, 10, 99, 328 
remainder, 4 
theorem, 107 
repeated root, 394 
representation, 222 
left regular, 222 
right regular, 226 
residue 
class, 126 
least, 439 
right 
annihilator of a, 57 
congruence modulo a 
subgroup, 238 
coset, 239, 255 
ideal, 148 
regular representation, 226 
Ting(s), 44 
arithmetic in, 59 
basic properties, 59 
Boolean, 69 
Cartesian product of, 51 
center, 57 
characteristic of, 70, 399 
Chinese Remainder Theorem 
for, 453 
commutative, 44, 162 
congruence-class, 125 
division, 58 
extension, 550 
of Gaussian integers, 322 
homomorphism, 75, 154 
with identity, 44 
isomorphic, 70 
local, 167 
matrix, 46, 543 
polynomial, 86, 545 
quaternion, 58 
quotient, 152, 162 
subtraction in, 60 
units, 63 
zero divisors, 64 
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Rivest, R., 438 
root, 106 
adjoining a, 379 
field, 388 
multiple, 111 
nth, 423, 426 
rational, 87, 113 
rational root test, 113 
Tepeated, 394 
of unity, 426 
row vector, 541 
RSA code system, 438 
Ruffini, P., 407, 423 
ruler and compaas, 459 


Sa 172 
scalar matrix, 57 
scalar multiplication, 366 
Second Isomorphism Theorem 
for groups, 267, 272 
for rings, 161 
Second Sylow Theorem, 
300, 309 
semidirect product, 288 
separable/separability, 394 
set(s), 509 
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